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herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 
This is a FIRST submission of items concerning a filing under 35 U.S.C. 371. 
This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371 . 

S iS is ^^ X P r 5 SS TQ( \ u ? s l t0 begin national examination Procedures (35 U.S.C. 371(f)). The submission must include itens (5), (6) 
(9) and (24) indicated below. 1 ;>v ; ' 

The US has been elected by the expiration of 19 months from the priority date (Article 31). 
A copy of the International Application as filed (35 U.S.C. 371 (c) (2)) 

a. E9 is attached hereto (required only if not communicated by the International Bureau). 

b. KJ has been communicated by the International Bureau. 

c. □ is not required, as the application was filed in the United States Receiving Office (RO/US). 
An English language translation of the International Application as filed (35 U.S.C. 371(c)(2)). 

a. □ is attached hereto. 

b. □ has been previously submitted under 35 U.S.C. 154(d)(4). 

Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 371 (c)(3)) 

a. □ are attached hereto (required only if not communicated by the International Bureau). 

b. □ have been communicated by the International Bureau. 

c. □ have not been made; however, the time limit for making such amendments has NOT expired. 

d. □ have not been made and will not be made. 

An English language translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371(c)(3)). 
An oath or declaration of the inventor(s) (35 U.S.C. 371 (c)(4)). 

An English language translation of the annexes of the International Preliminary Examination Report under PCT 
Article 36 (35 U.S.C. 371 (c)(5)). 

A copy of the International Preliminary Examination Report (PCT/IPEA/409). 
A copy of the International Search Report (PCT/ISA/210). 
13 to 20 below concern documents) or information included: 
An Information Disclosure Statement under 37 CFR 1.97 and 1.98. 

An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.3 1 is included. 
A FIRST preliminary amendment. 
A SECOND or SUBSEQUENT preliminary amendment. 
A substitute specification. 

A change of power of attorney and/or address letter. 

A computer-readable form of the sequence listing in accordance with PCT Rule 13ter.2 and 35 U.S.C. 1.821 - 1.825. 
A second copy of the published international application under 35 U.S.C. 154(d)(4). 
A second copy of the English language translation of the international application under 35 U.S.C. 154(d)(4). 
Certificate of Mailing by Express Mail 
Other items or information: 

copy of this transmittal letter for charging purposes 
return postcard 
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^PLICATION NO. (IF KNOWN, SEE 37 C^R 



INTERNATIONAL APPLICATION NO. 
PCT/GBOO/00916 



24. The following fees are suBmittecf - 

BASIC NATIONAL FEE ( 37 CFR 1.492 (a) (1) - (5)) : 

□ Neither international preliminary examination fee (37 CFR 1.482) nor 
international search fee (37 CFR 1.445(a)(2)) paid to USPTO 

and International Search Report not prepared by the EPO or JPO $1000.00 



□ 
□ 



International preliminary examination fee (37 CFR L482) not paid to 
USPTO but International Search Report prepared by the EPO or JPO 

International preliminary examination fee (37 CFR 1.482) not paid to USPTO 
but international search fee (37 CFR 1.445(a)(2)) paid to USPTO 

International preliminary examination fee (37 CFR 1.482) paid to USPTO 
but all claims did not satisfy provisions of PCT Article 33(l)-(4) . 



□ International preliminary examination fee (37 CFR 1 .482) paid to USPTO 
and all claims satisfied provisions of PCT Article 33(l)-(4) . 



$860.00 



$710.00 



$690.00 



$100.00 



ATTORNEY'S DOCKET NUMBER 
PA-9912 



CALCULATIONS PTO USE ONLY 



ENTER APPROPRIATE BASIC FEE AMOUNT = 



$860.00 



Surcharge of $130.00 for furnishing the oath or declaration later than 
months from the earliest claimed priority date (37 CFR 1.492 (e)). 



□ 20 □ 30 



$0.00 



CLAIMS 



NUMBER FILED 



NUMBER EXTRA 



RATE 



Total claims 



24 



-20 = 



x $18.00 



$72.00 



Independent claims 



3- 



x $84.00 



$0.00 



Multiple Dependent Claims (check if applicable). 



□ 



$0.00 



TOTAL OF ABOVE CALCULATIONS = 



$932.00 



□ Applicant claims small entity status. (See 37 CFR 1.27). The fees indicated above are 
reduced by 1/2. 



$0.00 



SUBTOTAL = 



$932.00 



Processing fee of $130.00 for furnishing the English translation later than 
months from the earliest claimed priority date (37 CFR 1.492 (f)). 



□ 20 □ 30 



$0.00 



TOTAL NATIONAL FEE = 



$932.00 



Fee for recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.31) (check if applicable). 



□ 



$0.00 



TOTAL FEES ENCLOSED = 



$932.00 



Amount to be: 
refunded 



charged 



a. 
b. 



d. 



□ 

□ 



A check in the amount of 



to cover the above fees is enclosed. 

500-588 in the amount of $932.00 



to cover the above fees. 



Please charge my Deposit Account No. _ 
A duplicate copy of this sheet is enclosed. 

The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any overpayment 
to Deposit Account No. 500-588 A duplicate copy of this sheet is enclosed. 

Fees are to be charged to a credit card. WARNING: Information on this form may become public. Credit card 
information should not be included on this form. Provide credit card information and authorization on PTO-2038. 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 CFR 
% 137(a) or (b)) must be filed and granted to restore the application to pending status. 

SEND ALL CORRESPONDENCE TO: 



Royal N. Ronning, Jr. 
Amersham Pharmacia Biotech, Inc. 
800 Centennial Avenue 
Piscataway, New Jersey 08855 

(732) 457-8423 




Royal N. Ronning, Jr. 



NAME 
32,529 



REGISTRATION NUMBER 

August 28, 2001 

DATE 
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PA-9912 

UNITED STATES PATENT AND TRADEMARK OFFICE 

M. Reeve, et al. Group Art Unit: To be assigned 

09/914,604 Examiner: To be assigned 

August 28, 2001 
Genetic Analysis 

SECOND PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents 
Box PCT 

Washington, D.C. 20234 
Sir: 

Please consider the following amendments and remarks in connection with the 
prosecution of the captioned application. 

In the Specification 

At the end of the written description, before the claims, please insert the Sequence Listing 
attached hereto. 

Remarks 



Applicants have amended the specification by insertion of a sequence listing. The 
sequence listing has also been submitted under separate cover in an initial computer 
readable form as well as on a paper copy in response to the "Notification of Missing 
Requirements Under 35 U.S.C. 371 in the United States Designated/Elected Office 
(DO/EO/US)" having a mailing date of October 25, 2001 . 



IN THE 

Application of: 
Serial Number: 
Filing Date: 
Title: 



Early and favorable consideration is requested. 



Amersham Biosciences 
800 Centennial Avenue 
P. 0. Box 1327 

Piscataway, New Jersey 08855-1327 

Tel: (732)457-8071 
Fax: (732) 457-8463 



Respectfully submitted, 




H:\DEPT\Law_Dept\IP Dept\Amendments\Second Preliminary Amendment-PA9912 (1 l-6-01).doc 



SEQUENCE LISTING 



<110> Reeve, Michael Alan 

Workman, Nicholas Ian 
Martin- Parras, Luis 

<120> Genetic Analysis 

<130> PA9912 

<140> 09/914,604 
<141> 2001-08-28 

<150> PCT/GB00/00916 
<151> 2000-03-10 

<150> EPO 99301933.0 
<151> 1999-03-12 

<160> 32 

<170> Patentln Ver, 2.1 

<210> 1 
<211> 3954 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Plasmid 

<400> 1 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 12 0 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 18 0 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240 
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 3 00 
tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360 
tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgagctcggt accgggcccc 42 0 
ccctcgaggt cgacggtatc gataagcttg atcgcagctg gtaatccgga cgcccgcgtc 480 
gaagatgttg gggtgttgta acaatatcga ttccaattca gcgggggcca cctgatatcc 540 
tttgtattta attaaagact tcaagcggtc aactatgaag aagtgttcgt cttcgtccca 600 
gtaaggatcc gcactttgaa ttttgtaatc ctgaagggat cgtaaaaaca gctcttcttc 660 
aaatctatac attaagacga ctcgaaatcc acatatcaaa tatccgagtg tagtaaacat 720 
tccaaaaccg tgatggaatg gaacaacact taaatgtaca ccctggtaat ccgttttaga 7 80 
atccatgata ataattttct ggattattgg taattttttt tgcacgttca aaattttttg 840 
caaccccttt ttggaaacaa acactacggt aggctgcgaa atgttcatac tgttgagcaa 900 



1 



ttcacgttca ttataagctt ttcactgcat acgacgattc tgtgatttgt attcagccca 960 
tatcgtttca tagcttctgc caaccgaacg gacatttcga agtattccgc gtacgtgatg 1020 
ttcacctcga tatgtgcatc tgtaaaagca attgttccag gaaccagggc gtatctcttc 108 0 
atagccatgg aatacgcctt tttcagtgtt gcgatgctaa tgccgttaca aataatccga 1140 
gcaccaagaa tggctgcgcg cttgcctggt acttgacgtc gtatttgacg gggtccttga 12 00 
gaaagtattt aaactggaac acaatctgag gaatgatcaa agcaaccaac gccaacgcat 1260 
aataactagt gcaataccaa gacctcccaa taatagcacc cagacttgtg taataacctc 1320 
tggctctgat attgctccag atggaattgg acgatatggc tcattaattg cgtcgatatc 138 0 
tctatcatac cagtcgttga ttgtctgtgt atagccagta agacaaggac cagacatcat 1440 
catgcaaaga atcgcttaag cccttcttgg cctttatgag gatctctctg atttttcttg 1500 
cgtcgagttt tccggtaaga cctttcggta cttcgtccac aaacacaact cctccgcgca 1560 
actttttcgc ggttgttact tgactggcca cgtaatccac gatctctttt tccgtcatcg 162 0 
tctttccgtg ctccaaaaca acaacggcgg cgggtccgga ttaccagctg cgatcaagct 1680 
tatcgatacc gtcgacctcg acctgcaggc atgcaagctt ggcgtaatca tggtcatagc 1740 
gttttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 18 00 
taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 1860 
cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 1920 
gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 1980 
tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 2040 
tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 2100 
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 2160 
agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 2220 
accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 2280 
ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 2340 
gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 2400 
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 2460 
gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 252 0 
taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag 2580 
tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 2 64 0 
gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 2700 
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 2760 
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 282 0 
cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 2880 
cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 294 0 
ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 3000 
taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 3 060 
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 3120 
ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 3180 
atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 3240 
gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 3300 
tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 3360 
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 3420 
taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 3480 
ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 3 54 0 
ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 3600 
cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 3660 
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 3 720 
gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 3 780 



2 



gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 3840 
aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca 3900 
ttattatcat gacattaacc tataaaaata ggcgtatcac gaggcccttt cgtc 3954 

<210> 2 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 2 

cgcagctggt aatccggacg cccgcgtcga agatgtt 3 7 

<210> 3 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 3 

cgcagctggt aatccggacc cgccgccgtt gttgtt 3 6 

<210> 4 
<211> 2946 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Plasmid 

<400> 4 

atgaccatga ttacgccaag ctctaatacg actcactata gggaaagctt ccggacgtct 60 
caggctaatg ttggcccacc gacgttccac gatggggcgc tcttaagggc ttagaccctc 120 
gtcgggagta tttctgtgat ctggcgacac tcacgcgaga agtcattacc ggcgatatga 180 
attcactggc cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta 240 
atcgccttgc agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg 3 00 
atcgcccttc ccaacagttg cgcagcctga atggcgaatg ggaaattgta aacgttaata 360 
ttttgttaaa attcgcgtta aatttttgtt aaatcagctc attttttaac caataggccg 420 
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aaatcggcaa aatcccttat aaatcaaaag 
cagtttggaa caagagtcca ctattaaaga 
ccgtctatca gggcgatggc ccactacgtg 
cgaggtgccg taaagcacta aatcggaacc 
ggggaaagcc ggcgaacgtg gcgagaaagg 
gggcgctggc aagtgtagcg gtcacgctgc 
cgccgctaca gggcgcgtca ggtggcactt 
gtttattttt ctaaatacat tcaaatatgt 
tgcttcaata atattgaaaa aggaagagta 
ttcccttttt tgcggcattt tgccttcctg 
taaaagatgc tgaagatcag ttgggtgcac 
gcggtaagat ccttgagagt tttcgccccg 
aagttctgct atgtggcgcg gtattatccc 
gccgcataca ctattctcag aatgacttgg 
ttacggatgg catgacagta agagaattat 
ctgcggccaa cttacttctg acaacgatcg 
acaacatggg ggatcatgta actcgccttg 
taccaaacga cgagcgtgac accacgatgc 
tattaactgg cgaactactt actctagctt 
cggataaagt tgcaggacca cttctgcgct 
ataaatctgg agccggtgag cgtgggtctc 
gtaagccctc ccgtatcgta gttatctaca 
gaaatagaca gatcgctgag ataggtgcct 
aagtttactc atatatactt tagattgatt 
aggtgaagat cctttttgat aatctcatga 
actgagcgtc agaccccgta gaaaagatca 
gcgtaatctg ctgcttgcaa acaaaaaaac 
atcaagagct accaactctt tttccgaagg 
atactgtcct tctagtgtag ccgtagttag 
ctacatacct cgctctgcta atcctgttac 
gtcttaccgg gttggactca agacgatagt 
c 99ggggttc gtgcacacag cccagcttgg 
tacagcgtga gctatgagaa agcgccacgc 
cggtaagcgg cagggtcgga acaggagagc 
ggtatcttta tagtcctgtc gggtttcgcc 
gctcgtcagg ggggcggagc ctatggaaaa 
tggccttttg ctggcctttt gctcacatgt 
ataaccgtat taccgccttt gagtgagctg 
gcagcgagtc agtgagcgag gaagcggaag 
cgcgttggcc gattcattaa tgcagctggc 
gtgagcgcaa cgcaattaat gtgagttagc 
ttatgcttcc ggctcgtatg ttgtgtggaa 
acagct 



aatagaccga gatagggttg agtgttgttc 480 
acgtggactc caacgtcaaa gggcgaaaaa 54 0 
aaccatcacc ctaatcaagt tttttggggt 600 
ctaaagggag cccccgattt agagcttgac 660 
aagggaagaa agcgaaagga gcgggcgcta 720 
gcgtaaccac cacacccgcc gcgcttaatg 78 0 
ttcggggaaa tgtgcgcgga acccctattt 840 
atccgctcat gagacaataa ccctgataaa 900 
tgagtattca acatttccgt gtcgccctta 960 
tttttgctca cccagaaacg ctggtgaaag 102 0 
gagtgggtta catcgaactg gatctcaaca 1080 
aagaacgttt tccaatgatg agcactttta 1140 
gtattgacgc cgggcaagag caactcggtc 12 00 
ttgagtactc accagtcaca gaaaagcatc 1260 
gcagtgctgc cataaccatg agtgataaca 1320 
gaggaccgaa ggagctaacc gcttttttgc 13 80 
atcgttggga accggagctg aatgaagcca 144 0 
ctgtagcaat ggcaacaacg ttgcgcaaac 1500 
cccggcaaca attaatagac tggatggagg 1560 
cggcccttcc ggctggctgg tttattgctg 162 0 
gcggtatcat tgcagcactg gggccagatg 168 0 
gcacggggag tcaggcaact atggatgaac 1740 
cactgattaa gcattggtaa ctgtcagacc 1800 
taaaacttca tttttaattt aaaaggatct 1860 
caaaaatccc ttaacgtgag ttttcgttcc 1920 
aaggatcttc ttgagatcct ttttttctgc 1980 
caccgctacc agcggtggtt tgtttgccgg 2 04 0 
taactggctt cagcagagcg cagataccaa 2100 
gccaccactt caagaactct gtagcaccgc 216 0 
cagtggctgc tgccagtggc gataagtcgt 2220 
taccggataa ggcgcagcgg tcgggctgaa 22 80 
agcgaacgac ctacaccgaa ctgagatacc 2340 
ttcccgaagg gagaaaggcg gacaggtatc 2400 
gcacgaggga gcttccaggg ggaaacgcct 2460 
acctctgact tgagcgtcga tttttgtgat 2520 
acgccagcaa cgcggccttt ttacggttcc 2580 
tctttcctgc gttatcccct gattctgtgg 2640 
ataccgctcg ccgcagccga acgaccgagc 27 00 
agcgcccaat acgcaaaccg cctctccccg 2760 
acgacaggtt tcccgactgg aaagcgggca 282 0 
tcactcatta ggcaccccag gctttacact 2 88 0 
ttgtgagcgg ataacaattt cacacaggaa 2 94 0 

2946 



<210> 5 
<211> 19 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 5 

gttttcccag tcacgacgt 

<210> 6 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 6 

tccggacgtc tcaggctaat gtt 

<210> 7 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 7 

tgtaacgaca cattgctgga tacc 

<210> 8 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 8 

atataactct cgctccttga taac 



<210> 9 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 9 

aggcgtctga ggctgcggct atgg 

<210> 10 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 10 

aacccgtcgc gacgagagtc taag 

<210> 11 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 11 

gatatacgtg atatattttg attg 

<210> 12 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 



<400> 12 

gatcggtatc cagcaatgtg tcgttaca 

<210> 13 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
Oligonucleotide 

<400> 13 

agctgttatc aaggagcgag agttatat 

<210> 14 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
01 igonuc 1 eo t i de 

<400> 14 

catgccatag ccgcagcctc agacgcct 

<210> 15 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
01 igonucleo t ide 

<400> 15 

ctagcttaga ctctcgtcgc gacgggtt 

<210> 16 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



28 



Sequence: Synthetic 



28 



Sequence : Synthetic 



28* 



Sequence: Synthetic 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 16 

ttaacaatca aaatatatca cgtatatc 

<210> 17 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 17 

tgtaacgaca cattgctgga taccgatcc 

<210> 18 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
01 igonuc 1 eo t i de 

<400> 18 

atataactct cgctccttga taacagctt 

<210> 19 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 19 

aggcgtctga ggctgcggct atggcatgg 
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<210> 20 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 20 

aacccgtcgc gacgagagtc taagctagt 2i 

<210> 21 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 21 

gatatacgtg atatattttg attgttaag 2 9 

<210> 22 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 22 

gcgctaggga tccttactgg gacgaagacg aa 32 

<210> 23 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 
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<400> 23 

gcgctaggga tccttactgg gacgaagacg aa 

<210> 24 
<211> 93 
<212> DNA 

<213> Artificial Sequence 



32 



<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 24 

cccgggggat cctcgtttta ttgggccgag ttttggtccg tagtgcttgg ttagatatgc 60 
ttatgttcac aaaatcatcc ttgtacagaa ttc 93 

<210> 25 
<211> 93 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 25 

cccgggggat cctcgtttta ttgggccgag ttttggtccg tagtgcatgg ttagatatgc 60 
ttatgttcac aaaatcatcc ttgtacagaa ttc 93 

<210> 26 
<211> 93 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
01 igonuc 1 eo t ide 

<400> 26 

cccgggtgta cacaaaagtt tacctgaaga acgtgggggg tcgtgcctgg tcttgcgtca 60 
cctggtctca ggagagggtc cccatgggaa ttc 93 

<210> 27 
<211> 18 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 27 

ctactgatcg gatccccg 

<210> 28 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 28 

aaacgacggc cagtgaat 

<210> 29 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 29 

ctactgatcg gatccccg 

<210> 30 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 30 

aaacgacggc cagtgaat 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 31 

ggccgagttt tggtccgtag 

<210> 32 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Oligonucleotide 

<400> 32 

gtcttgcgtc acctggtctc ag 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Application of: M. Reeve, et al. Group Art Unit: To be assigned 

Serial Number: To be assigned Examiner: To be assigned 

Filing Date: August 28, 200 1 

Title: Genetic Analysis 

FIRST PRELIMINARY AMENDMENT 

Honorable Assistant Commissioner of Patents 
Box Patent Application 
Washington, D.C. 20231 

Sir: 

Please consider the following amendments and remarks in connection with the 
prosecution of the captioned application, which is a filing under 35 U.S.C. § 371 and 
claims priority to international application number PCT/GB00/00916 filed MarchlO, 
2000. This application also claims priority to application number 99301933.0 filed in the 
European Patent Office on March 12, 1999. 

In the Claims 

Please amend page 155, line 1 as follows: 

rCLAIMSI 

What is claimed is: 



Please cancel claims 3, 12, 13, 19, 20, and 26, without prejudice. 



Please amend claim 1 as follows: 



1 . (once amended) [A]Ina method of providing a mixture of DN A fragments 
enriched in fragments that are characteristic of a phenotype of interest, [byj which 
method includes providing affected DNA in fragmented form and providing 
unaffected DNA in fragmented form, [which method comprisesjthe improvement 
comprising : 

a) mixing the fragments of the affected DNA and the fragments of the unaffected 
DNA under hybridising conditions to form hybrids ; 

b) recovering a mixture of hybrids that contain mismatches; 

c) recovering fragments of the affected DNA from the mixture of hybrids that 
contain mismatches[; 

and optionally repeating steps a), b) and c) one or more times]. 

Please amend claim 2 as follows: 

2. (once amended) The method of claim 1 wherein the affected DNA is pooled DNA 
of one or more individuals who show the phenotype of interest, and the unaffected 
DNA is pooled DNA of one or more individuals who do not show the phenotype 
of interest. 



Please amend claim 6 as follows: 



6. (once amended) The method of [any one of claims 1 to 5] claim L wherein step b) 
is performed by use of a mismatch-binding protein. 

Please amend claim 7 as follows: 

7. (once amended) The method of [any one of claims 1 to 6] claim 1 , wherein either 
the fragments of the affected DNA or the fragments of the unaffected DNA are 
tagged by one member of a specific binding pair, and step c) is performed by 
using the other member of the specific binding pair. 

Please amend claim 9 as follows: 

9. (once amended) The method of [any one of claims 1 to 8]claimJ_, 

[wherein] further comprising subjecting the mixture of DNA fragments enriched 
in fragments that are characteristic of the phenotype of interest [, is subjected] to 
self-hybridisation [followed by recovery of] to form duplexes and subsequently 
recovering the perfectly matched duplexes. 



Please amend claim 10 as follows: 

10. (once amended) The method of [any one of claims 1 to 9] claim 1 , 
[wherein] further comprising mixing the mixture of DNA fragments enriched in 
fragments that are characteristic of the phenotype of interest^ is mixed] with an 
excess of the fragments of the affected DNA under hybridisation conditions[ ? ]_to 
form duplexes and subsequently recovering the [ followed by recovery of] 
perfectly matched duplexes. 

Please amend claim 1 1 as follows: 

1 1 . (once amended) The method of [any one of claims 1 to 1 0] claim L wherein each 
of the affected DNA and the unaffected DNA is provided in fragmented form by 
digestion with from 4 to 7 six-cutter restriction endonuclease enzymes together 
with from 0 to 50 four-cutter restriction endonuclease enzymes. 

Please amend claim 15 as follows: 

15. (once amended) The method of [claim 13 or ]claim 14, wherein steps a to f) are 
repeated using each different subset of r restriction endonuclease enzymes to give 
(n!)/[(n-r)!r!] different arrays. 



Please amend claim 16 as follows: 



16. (once amended) The method of [any one of claims 13 to 15] claim 14 , wherein the 
n restriction endonuclease enzymes are selected from 4-cutters and 5 -cutters and 
6-cutters. 



Please amend claim 17 as follows: 



17. (once amended) The method of [any one of claims 13 to 16] claim 14 , wherein the 
n is 3 to 10 and r is 2 to 4. 



Please amend claim 21 as follows: 



21 . (once amended) [The] A set of arrays [of claim 19 or claim 20] produced by the 

method of claim 14 , derived from a set of n = 6 six-cutter restriction endonuclease 
enzymes which are BamHI; Bsr GI; Hind III; Ncol; Spel; and AfllL 



Please amend claim 22 as follows: 



22. (once amended) [The] A set of arrays [of claim 19 or claim 20] produced by the 
method of claim 14, derived from the set of n = 6 six-cutter restriction 



endonuclease enzymes which are EcoRI; BspHI; Bglll; Xbal; Acc65I; and ApaLL 



Please amend claim 23 as follows: 

23. (once amended) A nucleic acid characterisation method which comprises 

presenting to [the]a set of arrays [of any one of claims 19 to 22] produced by the 
method of claim 14 a nucleic acid fragment of interest under hybridisation 
conditions, and observing a pattern of hybridisation. 

Please amend claim 28 as follows: 

28. (once amended) The double-stranded DNA molecule of claim 27, wherein[ the 
following criteria are satisfied]: 

a) inter-fragment length differences are greater for larger fragments; 

b) all possible fragments are unambiguously resolvable by electrophoresis from 
one another; 

c) size gaps between bands comprising different numbers of inter-restriction-site 
units are larger than size gaps between bands comprising the same number of 
inter-restriction-site units; 

d) the size gaps and size spread from the largest to the smallest fragment are 
electrophorectically compatible. 



Please add new claim 29 as follows: 

29. (new) The method of claim 1 which further comprises steps a), b) and c) one or 
more times. 

Please add new claim 30 as follows: 

30. (new) The method of claim 14 wherein one of the restriction enzyme specific 
primers is tagged. 

Remarks 

Claims 1-28 are pending in the instant application. Applicants have amended 
claims 1, 2, 6, 7, 9, 10, 11, 15, 16, 17, 21, 22, 23, and 28 to more folly conform with U.S. 
practice and to delete multiple dependencies. Applicants have cancelled, without 
prejudice, claims 3, 12, 13, 19, 20, and 26. Applicants have also added new claims 29 and 
30. A version of the claims marked up to show the amendments, as well as a clean 
version of the claims encompassing the amendments, is attached hereto. 

Applicants respectfully assert that all amendments are fairly based on the 
specification, and respectfully request their entry. 
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Applicants believe that the claims, as amended, are in allowable form, and 
earnestly solicit the allowance of claims 1, 2, 4-11, 14-18, 21-25, and 27-30. 



Amersham Pharmacia Biotech, Inc. 
800 Centennial Avenue 
P. O. Box 1327 

Piscataway, New Jersey 08855-1327 

Tel: (732)457-8423 
Fax: (732) 457-8463 



H:\DEPT\Law_Dept\IP Dept\Amendments\First Preliminary Amendment-PA9912 (8-28-01), doc 



Respectfully submitted, 




Royal N. Ronning, Jr. 32^29 
Attorney for Applicants 
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Claims (marked-up version showing amendments )) 

[CLAIMS] 

What is claimed is: 

1 . (once amended) [ A]Ina method of providing a mixture of DN A fragments 
enriched in fragments that are characteristic of a phenotype of interest, [bvj which 
method includes providing affected DNA in fragmented form and providing 
unaffected DNA in fragmented form, [which method comprises] the improvement 
comprising : 

a) mixing the fragments of the affected DNA and the fragments of the unaffected 
DNA under hybridising conditions to form hybrids ; 

b) recovering a mixture of hybrids that contain mismatches; 

c) recovering fragments of the affected DNA from the mixture of hybrids that 
contain mismatches [; 

and optionally repeating steps a), b) and c) one or more times]. 

2. (once amended) The method of claim 1 wherein the affected DNA is pooled DNA 
of one or more individuals who show the phenotype of interest, and the unaffected 
DNA is pooled DNA of one or more individuals who do not show the phenotype 
of interest. 




6. (once amended) The method of [any one of claims 1 to 5] claim L wherein step b) 
is performed by use of a mismatch-binding protein. 



7. (once amended) The method of [any one of claims 1 to 6] claim L wherein either 
the fragments of the affected DNA or the fragments of the unaffected DNA are 
tagged by one member of a specific binding pair, and step c) is performed by 
using the other member of the specific binding pair. 



9. (once amended) The method of [any one of claims 1 to 8] claim 1 , 

[wherein] further comprising subjecting the mixture of DNA fragments enriched 
in fragments that are characteristic of the phenotype of interest^ is subjected] to 
self-hybridisation [followed by recovery of| to form duplexes and subsequently 
recovering the perfectly matched duplexes. 



10. (once amended) The method of [any one of claims 1 to 91 claim L 

[wherein] further comprising mixing the mixture of DNA fragments enriched in 
fragments that are characteristic of the phenotype of interest^ is mixed] with an 
excess of the fragments of the affected DNA under hybridisation conditions [J_to 
form duplexes and subsequently recovering the [ followed by recovery of] 
perfectly matched duplexes. 



11. (once amended) The method of [any one of claims 1 to 10] claim 1 , wherein each 
of the affected DNA and the unaffected DNA is provided in fragmented form by 
digestion with from 4 to 7 six-cutter restriction endonuclease enzymes together 
with from 0 to 50 four-cutter restriction endonuclease enzymes. 



15. (once amended) The method of [claim 13 or ]claim 14, wherein steps a to f) are 
repeated using each different subset of r restriction endonuclease enzymes to give 
(n!)/[(n-r)!r!] different arrays. 



1 6. (once amended) The method of [any one of claims 1 3 to 1 5] claim 14 , wherein the 
n restriction endonuclease enzymes are selected from 4-cutters and 5 -cutters and 
6-cutters. 



17. (once amended) The method of [any one of claims 13 to 16] claim 14 , wherein the 
n is 3 to 10 and r is 2 to 4. 



2 1 . (once amended) [The] A set of arrays [of claim 1 9 or claim 20] produced by the 

method of claim 14 , derived from a set of n = 6 six-cutter restriction endonuclease 
enzymes which are BamHI; Bsr GI\ Hind III; Ncol; Spel; and AJIIL 



22. 



(once amended) [The] A set of arrays [of claim 19 or claim 20] produced by the 



method of claim 14 , derived from the set of n = 6 six-cutter restriction 
endonuclease enzymes which are EcoRI; BspHI; BgUI; Xbal; Acc65I; and ApaLL 

(once amended) A nucleic acid characterisation method which comprises 
presenting to [the]a set of arrays [of any one of claims 19 to 22] produced by the 
method of claim 14 a nucleic acid fragment of interest under hybridisation 
conditions, and observing a pattern of hybridisation. 

(once amended) The double-stranded DNA molecule of claim 27, wherein[ the 
following criteria are satisfied]: 

a) inter-fragment length differences are greater for larger fragments; 

b) all possible fragments are unambiguously resolvable by electrophoresis from 
one another; 

c) size gaps between bands comprising different numbers of inter-restriction-site 
units are larger than size gaps between bands comprising the same number of 
inter-restriction-site units; 

d) the size gaps and size spread from the largest to the smallest fragment are 
electrophorectically compatible. 

(new) The method of claim 1 which further comprises steps a), b) and c) one or 
more times. 



30. (new) The method of claim 14 wherein one of the restriction enzyme specific 
primers is tagged. 
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Claims (clean version encompassing amendments) 

What is claimed is: 

1 . (once amended) In a method of providing a mixture of DNA fragments enriched 
in fragments that are characteristic of a phenotype of interest, which method 
includes providing affected DNA in fragmented form and providing unaffected 
DNA in fragmented form, the improvement comprising: 

a) mixing the fragments of the affected DNA and the fragments of the unaffected 
DNA under hybridising conditions to form hybrids; 

b) recovering a mixture of hybrids that contain mismatches; 

c) recovering fragments of the affected DNA from the mixture of hybrids that 
contain mismatches. 

2. (once amended) The method of claim 1 wherein the affected DNA is pooled DNA 
of one or more individuals who show the phenotype of interest, and the unaffected 
DNA is pooled DNA of one or more individuals who do not show the phenotype 
of interest. 

4. The method of claim 1, wherein the affected DNA is DNA of one individual who 
shows the phenotype of interest, and the unaffected DNA is pooled DNA of a 



complete set of ancestors who do not show the phenotype of interest. 

5. The method of claim 1 , wherein the affected DNA is DNA from cells of an 
individual that show the phenotype of interest, and the unaffected DNA is DNA 
from cells of the individual that do not show the phenotype of interest. 

6. (once amended) The method of claim 1, wherein step b) is performed by use of a 
mismatch-binding protein. 

: 7. (once amended) The method of claim 1 , wherein either the fragments of the 

I affected DNA or the fragments of the unaffected DNA are tagged by one member 

* of a specific binding pair, and step c) is performed by using the other member of 

1 the specific binding pair. 

I 8. The method of claim 7, wherein the fragments of the unaffected DNA are tagged 

with biotin, and step c) is performed by use of immobilised streptavidin. 

9. (once amended) The method of claim 1, further comprising subjecting the mixture 
of DNA fragments enriched in fragments that are characteristic of the phenotype 
of interest to self-hybridisation to form duplexes and subsequently recovering the 
perfectly matched duplexes. 



(once amended) The method of claim 1, further comprising mixing the mixture of 
DNA fragments enriched in fragments that are characteristic of the phenotype of 
interest with an excess of the fragments of the affected DNA under hybridisation 
conditions to form duplexes and subsequently recovering the perfectly matched 
duplexes, 

(once amended) The method of claim 1 ? wherein each of the affected DNA and 
the unaffected DNA is provided in fragmented form by digestion with from 4 to 7 
six-cutter restriction endonuclease enzymes together with from 0 to 50 four-cutter 
restriction endonuclease enzymes. 

A method of making a set of arrays of fragments of DNA of interest, which 
method comprises; 

a) selecting, from a set of n restriction endonuclease enzymes, a subset of r 
restriction endonuclease enzymes; 

b) digesting genomic DNA with the subset of r enzymes; 

c) ligating to the resulting fragments restriction-enzyme-cutting-site-specific 
adapters with unique polymerase chain reaction amplifiable sequences; 

d) splitting the resulting fragments into r 2 aliquots; 

e) amplifying each aliquot with two restriction-enzyme-specific primers; 

f) forming an array of the r 2 aliquots of non-tagged amplimer strands; and 



g) repeating steps a) to f) using one or more different subsets of r restriction 
endonuclease enzymes. 

(once amended) The method of claim 14, wherein steps a to f) are repeated using 
each different subset of r restriction endonuclease enzymes to give (n!)/[(n-r)!r!] 
different arrays. 

(once amended) The method of claim 14, wherein the n restriction endonuclease 
enzymes are selected from 4-cutters and 5-cutters and 6-cutters. 

(once amended) The method of claim 14, wherein the n is 3 to 10 and r is 2 to 4. 

The method of claim 17, wherein n = 6 and r = 3. 

(once amended) A set of arrays produced by the method of claim 14, derived from 
a set of n = 6 six-cutter restriction endonuclease enzymes which are BamHI; Bsr 
GI; Hind III; Ncol; Spel; and Aflll. 

(once amended) A set of arrays produced by the method of claim 14, derived from 
the set of n = 6 six-cutter restriction endonuclease enzymes which are EcoRI; 
Bspffl; Bglll; Xbal\ Acc65I; and ApaLL 



(once amended) A nucleic acid characterisation method which comprises 
presenting to a set of arrays produced by the method of claim 14 a nucleic acid 
fragment of interest under hybridisation conditions, and observing a pattern of 
hybridisation. 

The method of claim 23, wherein a plurality of nucleic acid fragments of interest 
are separately presented to the set of arrays, and the resulting patterns of 
hybridisation are compared. 

The method of claim 24, wherein the plurality of nucleic acid fragments of 
interest are drawn from the mixture of DNA fragments, enriched in fragments that 
are characteristic of aphenotype of interest, of claim 13. 

A double-stranded DNA molecule having the sequence a-A-b-B...X-y-Y-z where 
A, B . . .X and Y are unique restriction sites for n different restriction endonuclease 
enzymes, and a, b. . .y, z denotes distances in base pairs, characterised in that each 
fragment, obtainable by cutting the DNA molecule by means of any one or more 
up to n of the restriction enzymes, has a different length from every other 
fragment. 

(once amended) The double-stranded DNA molecule of claim 27, wherein: 




a) inter-fragment length differences are greater for larger fragments; 

b) all possible fragments are unambiguously resolvable by electrophoresis from 
one another; 

c) size gaps between bands comprising different numbers of inter-restriction-site 
units are larger than size gaps between bands comprising the same number of 
inter-restriction-site units; 

d) the size gaps and size spread from the largest to the smallest fragment are 
electrophorectically compatible. 

(new) The method of claim 1 which further comprises steps a), b) and c) one or 
more times. 



30. 



(new) The method of claim 14 wherein one of the restriction enzyme specific 
primers is tagged. 
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GENETIC ANALYSIS 



5 INTRODUCTION 



Limitations of the current approaches 

There are a number of limitations to carrying out association 
studies using single nucleotide polymorphisms (SNPs) and linkage 
io disequilibrium within human populations (see Science. Vol 278, p1580, 
(1997) for a review of such methods). We have no control over 
recombination frequency around a given locus or over past human genetic 
crossing. Some mutations will be closely correlated with nearby SNPs and 
others will not. 

15 

The need for whole genome analysis 

With the SNP and linkage disequilibrium approach (and many 
others), markers are essentially used as a surrogate for sequencing - the 
more markers, the better. The logical endpoint of the above argument is to 

20 look at every base in the human genome - and carry out what could be 
termed a whole genome association study. In essence, the sequence at 
every base would be determined for the genome of each member of a 
phenotypicaliy 'affected' and a phenotypically 'unaffected* population. 
Statistical correlations (associations) could then be drawn between 

25 sequence differences and phenotype. Such associations would have 

future predictive values for the phenotype of interest, knowing the genotype 
and could be of great value in medicine and pharmacogenetics. 

The current invention selectively enriches for DNA fragments 
that determine phenotype in the 'affected' population and thus makes the 

30 prospect of carrying out whole genome association studies for humans and 
other species a very real possibility. 
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Definition of terms used with the current invention 

Within the scope of the current invention, the individuals 
chosen for whole genome analysis may be human, animal or plant and 
they may be eukaryotic, prokaryotic or archaebacterial in origin. 
5 The terms 'affected' and 'unaffected' are used without 

limitation r order to categorise individuals into two groups - those that 
possess a defined phenotype of interest ('affected' individuals) and those 
that do not possess the phenotype of interest ('unaffected' individuals). 
The phenotype common to the 'affected' individuals may be either 

10 beneficial (e.g. for these individuals, a particular pharmaceutical entity 
might show high efficacy in a phase il clinical trial) or detrimental (e.g. for 
these individuals, a particular pharmaceutical entity might show adverse 
toxicology in a phase i clinical trial). 

The 'affected' population may comprise one or more 

15 individuals and the 'unaffected' population may similarly comprise one or 
more individuals according to the particular embodiment of the invention 
(see below). 

The term DNA is used throughout for simplicity. Within the 
scope of the current invention, the term DNA may equally well apply to all 

20 or part of the haploid, diploid or polyploid genomic DNA content of one or 
more germ line or somatic cell(s). The DNA may be extracted from ceils 
taken directly from the individual(s), the DNA may be extracted from cells 
cultured or immortalised from the individual(s) or the DNA may be prepared 
from a library of clones - with inserts derived from the individual(s) and 

25 propagated in some appropriate host and cloning vector system. For the 
particular case wherein the term DNA refers to the expressed part of the 
haploid, diploid or polyploid genomic DNA content of one or more somatic 
cells and the DNA is prepared from a library of clones - with inserts derived 
from the individual(s) and propagated in some appropriate host and cloning 

30 vector system, a cDNA library (normalised or otherwise) may be used. 
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In the current invention. DNA is compared in fragmented 
form. Fragmentation can be performed after DNA extraction, prior to 
cloning and/or after cloning. Restriction enzyme digestion is the preferred 
method for such fragmentation - though other methods (e.g. shearing or 
5 sonication) will be obvious to those skilled in the art. 

For the particular case wherein the DNA is prepared from a 
library of clones (either genomic clones or cDNA clones) - with inserts 
derived from the individual(s) and propagated in some appropriate host and 
cloning vector system and wherein restriction enzyme fragmentation is 

10 used prior to cloning, polymerase chain reaction amplification can be used 
to prepare the DNA for comparison in fragmented form. Priming sites 
within the vector sequence flanking the cloned restriction enzyme 
fragmented inserts may be usefully employed for one or more cycles of 
polymerase chain reaction amplification of the fragmented DNA of interest. 

15 The primers used for polymerase chain reaction amplification of the 
fragmented DNA of interest could again be used after the phenotype- 
determining fragment enrichment process to 'rescue' and clone the 
enriched fragments. 

Within the scope of the current invention, the terms 

20 biotinylation and streptavidin capture are used both as an example and as 
the currently preferred embodiment for the invention. The streptavidin may 
be surface attached to inert particles (magnetic or otherwise) or to vessel 
walls (e.g. microtitre plate wells). The biotin may be introduced via a 
deoxynucleotide triphosphate analogue using a polymerase; by using a 

25 biotin-conjugated primer and polymerase chain reaction amplification; 
chemically or photochemically. The use of biotin and streptavidin is not a 
limitation for the invention. The invention could equally be used with other 
high affinity capture systems well known to those skilled in the art (e.g. l his 
tag' introduction and metal ion affinity capture). 

30 Within the scope of the current invention, the term 'abnormal' 

- used with respect to the term 'normal' - is used without limitation in order 
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to denote a somatic ceil (or somatic cells) with a discernable phenotypic 
characteristic (or characteristics) arising from the acquisition of a different 
somatic mutation (or set of somatic mutations) from that (or those) seen in 
the 'normal' counterpart. Cells will most usually be considered 'abnormal' 

5 with respect to their 'norma! 1 counterparts through the acquisition of a 
different somatic mutation (or set of somatic mutations) leading to one or 
more of the following phenotypic characteristics: altered marker gene 
expression, altered genomic organisation, growth under certain selective 
culture conditions, immortalised growth in culture, unrestrained growth in 

io vivo or in vitro, failure of normal apoptotic control mechanisms in vivo or in 
vitro, induction of neovascularisation. escape of cells across epithelium, 
migratory cell survival or metastasis. 

Within the scope of the current invention, the term mismatch 
recognition protein is used without limitation to denote a protein of 

is eukaryotic, prokaryotic or archaebacterial origin capable of the selective 
recognition of (and binding to) a DNA duplex that is not perfectly matched 
along its entire length. Recognition of (and binding to) will be preferably for 
bases that are not engaged in correct Watson and Crick pairing and for 
small deletions or insertions. Many such proteins are known to those 

:o skilled in the art. Prokaryotic and eukaryotic mutS homologues, phage T4 
endonuciease VII, phage T7 endonuclease i and the plant enzyme CEL-1 
are just some examples. 

inter-population perfectly matched duplex depletion 

25 In the inter-population perfectly matched duplex depletion 

approach, we compare (in fragmented form) the pooled DNA of 'affected* 
individuals with the pooled DNA of 'unaffected' individuals (both from 
populations as outbred and otherwise similar to each other as possible). 
We are only interested in those regions where differences occur between 

30 'affected' and 'unaffected' DNA molecules. For populations as above, the 
only prevalent sequence differences within the 'affected 1 pool (compared to 
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the 'unaffected 1 pool) should be somewhere within the gene{s) (using the 
term gene in its widest sense to include exons. introns and all associated 
upstream and downstream regulatory sequences) that actually 
determine(s) their common phenotype. This means that we are no longer 
5 tied into working with rare (and perhaps atypical) populations where there 
is high genetic homogeneity. 

Pooling the DNA from entire phenotypicaily-defined 
populations massively reduces the amount of labour involved. 

io DNA sequence variation in populations 

Nickerson ef at, Nature Genetics, Vol 19, p233 (1998), sequenced 9.7 kb of 
the lipoprotein lipase gene from 71 individuals (24 African-Americans, 24 
Europeans and 23 European-Americans). This gene is fairly typical (90 % 
intron and 10 % exon - total size 30 kb with 10 exons). 

15 88 sequence variants were found (i.e. one per 1 1 0 bp on 

average). Most variations were found in non-coding sequence. 90 % of 
these were SNPs (60 % of which were transitions and 40 % were 
transversions). All of the SNPs were biailelic. 10 % of the sequence 
variants were insertions or deletions at repeat sequences. 

20 58 % of the sequence variants were present in all three ethnic 

populations. Half of these were found in heterozygous form and half in 
homozygous form. 

Nucleotide diversity (defined as the expected number of 
nucleotide differences per site between a random pair of chromosomes 

25 drawn from the population) is 1/500 for DNA in genera! and 1/2,000 for 
coding sequence DNA. This means that, on average, any two DNA 
fragments annealed together from such a population will contain a 
mismatch every 500 bases. 

DNA sequence variants are therefore very common. They 

30 are not, however, totally random - the variants that occur every 500 bases 
or so are limited; they are generally biailelic at just that single base. It is 
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this fact that the inter-population perfectly matched duplex depletion 
approach selectively exploits. 

Statistical analysis of the inter-population perfectly matched duplex 
5 depletion fragmentation process 

If the length of the DNA fragments is F bases and the 
average iength between sequence differences between any two DNA 
molecules is 500 bases, the probability that a hybrid duplex between any 
two random DNA fragments will contain no mismatches is given by 

10 

Pr(0) = e (TOOO) 

and the probability that a hybrid duplex between any two random DNA 
molecules will contain any mismatches is given by 

15 

Pr(>1) = {1-(e (F/500} )} 

Example values for Pr(0) and Pr(>1 ) for different values of F 
are given below 



F 


Pr(0) 


Pr{>1) 


10 


0.98 


0.02 


20 


0.96 


0.04 


50 


0.90 


0.10 


100 


0.82 


0.18 


200 


0.67 


0.33 


300 


0.55 


0,45 


400 


0.45 


0.55 


500 


0.37 


0.63 


600 


0.30 


0.70 


700 


0.25 


0.75 


800 


0.20 


0.80 


900 


0.17 


0,83 


1000 


0.14 


0.86 
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Example average restriction fragment sizes for DNA digestion with six 
6 bp cutters and up to four 4 bp cutters 

Each 6 bp cutter will cut DNA every 4.096 bp on average and 
each 4 bp cutter will cut DNA every 256 bp on average. 
5 For a given set of A 6 bp cutters and B 4 bp cutters (with no 

duplication of restriction enzyme cutting sequence), the average fragment 
length F will be 

{(A/4.096)+(B/256)}- 1 

io 

Example values for F as A and B are varied are given in the 
following table 



A 


B 


average size 
(F) 


6 


0 


683 


6 


1 


186 


6 


2 


108 


6 


3 


76 


6 


4 


59 



15 We should note that the above situation assumes that none of 

the 4 bp cutter recognition sites lie within any of the 6 bp cutter recognition 
sites. If, for example, we have a 4 bp cutter recognition site nested within a 
6 bp cutter recognition site (e.g. from the use of Mbol and BamHI'm the 
fragmentation), then we should reduce the value of A from 6 to 5 in the 

20 above. 

In genera!, if we have a given set of A enzymes that cut DNA 
every a bp on average, B enzymes that cut DNA every b bp on average, 
... , Z enzymes that cut DNA every z bp on average (with no duplication of 
restriction enzyme cutting sequence), the average fragment length F will be 

25 

{(A/aWB/b)+... + (Z/z)}* 1 
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Example sets of 6 bp cutters and 4 bp cutters for fragmentation 

Example sets of 6 bp cutters and 4 bp cutters that contain 
panels of six 6 bp cutters that are compatible with terminal restriction site 
profiling array (TRSPA) analysis (see below) are given in the following: 

Example enzyme set 1 

For restriction in M buffer + BSA 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 

■c 


U/\i\ 


Supplier 


inactivate 
°C/min 


1 


Mbol 


GATC 


100 


37 


25 


NEB 


65/20 


2 


Haelff 


GGCC 


100 


37 


50 


NEB 


80/20 


3 


Msel 


TTAA 


100 


37 


20 


NEB 


65/20 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 

°C 


U/*il 


Supplier 


Inactivate 
°C/min 


1 


BamHI 


GGATCC 


100 


37 


100 


NEB 


80/20 


2 


BsrGI 


TGTACA 


100 


37 


10 


NEB 


80/20 


3 


Hmdlli 


AAGCTT 


100 


37 


>40 


APB 


65/20 


4 


Ncol 


CCATGG 


100 


37 


50 


NEB 


65/20 


5 


Spef 


ACTAGT 


100 


37 


50 


NEB 


65/20 


6 


Affll 


CTTAAG 


100 


37 


10 


APB 


60/15 



10 

Example enzyme set 2 

For restriction in M buffer + BSA 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 

°C 




Supplier 


inactivate 
°C/min 


1 


Mbol 


GATC 


100 


37 


25 


NEB 


65/20 


2 


Haelli 


GGCC 


100 


37 


50 


NEB 


80/20 


3 


Msel 


TTAA 


100 


37 


20 


NEB 


65/20 
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Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 


U/ui 


Supplier 


Inactivate 
~C/min 


1 


EcoRI 


GAATTC 


100 


37 


>40 


APB 


65/20 


2 


BspHl 


TCATGA 


100 


37 


10 


APB 


65/20 


3 


Bglll 


AGATCT 


75 


37 


>40 


APB 


No 


4 


Xbal 


TCTAGA 


100 


37 


100 


NEB 


65/20 


5 


Acc65l 


GGTACC 


75 


37 


10 


NEB 


65/20 


6 


ApaLi 


GTGCAC 


100 


37 


10 


NEB 


No 



Aspect (I) 

In one aspect the invention provides a method of providing a 
mixture of DNA fragments enriched in fragments that are characteristic of a 
5 phenotype of interest, by providing affected DNA in fragmented form and 
providing unaffected DNA in fragmented form f which method comprises: 

a) mixing the fragments of the affected DNA and the fragments 
of the unaffected DNA under hybridising conditions; 

b) recovering a mixture of hybrids that contain mismatches; 

io c ) recovering fragments of the affected DNA from the mixture of 

hybrids that contain mismatches; 

and optionally repeating steps a), b) and c) one or more 

times. 

is Inter-population mismatch-containing duplex selection 

'Affected' versus 'unaffected' (i.e. inter-population) mismatch- 
containing duplex selection can be achieved by attaching a mismatch- 
binding protein to a solid support (or using the mismatch-binding protein in 
solution followed by subsequent solid-phase capture), taking fragmented 
20 and denatured 'affected' DNA and hybridising this to an excess of 
fragmented, denatured and biotinyiated 'unaffected' DNA with ensuing 
capture of mismatch-containing duplex molecules. Releasing the 
mismatch-containing duplex molecules without denaturation, streptavidin 
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capture and then release of the non-biotinylated strands will give only the 
desired species as shown below. 

Method 

Fragment the 'affected 5 DNA. Fragment and derivatise the 
'unaffected' DNA with biotin. Only DNA from this population will be 
streptavidin captured. Melt and anneal to give 



A 



V 



biotin A biotin 



biotin Biotin 



biotin * biotin 



10 Mismatch-binding protein select. Capture only the mismatch- 

containing duplexes. Release without denaturation to give 



A biotin 

biotin v 



V 



biotin 
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Streptavidin capture to give 

a biotin 

biotm v 

A biotin 

v 

Release the non-biotinyiated strands to give 

5 

a biotin 

v *~ ' 

v ~ — 

Repeat as necessary. 

Repetition of the above sequence of reactions will lead to 
inter-population perfectly matched duplex depletion. 

io 

What will be purified ? 

Inter-poputation mismatch-containing duplex selection as 
above ensures that all of the various phenotype-determining fragments 
(unique to the 'affected 1 population) are captured for subsequent analysis - 
15 but it also causes the co-purification of very many SNP-containing {'noise') 
fragments. 

We now need to consider the fate of the various types of 
fragment (i.e. those that determine the phenotype and those that do not) as 
we carry out inter-population perfectly matched duplex depletion cycles as 
20 above. 
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Recovery of 'affected' DNA moiecuies after streptavidin capture 

For a particular fragment, if we have X moiecuies of 'affected' 
DNA and Y moiecuies of 'unaffected 5 biotinylated DNA, after complete 
hybridisation, there will be a ratio of {Y/(X+Y)} streptavidin-capturabie 
5 moiecuies to {X/(X+Y)} streptavidin-non-capturabie moiecuies. We can 
thus manipulate the yield of streptavidin-capturabie hybrids by varying X 
and Y. 

Example recovery and loss figures for various X and Y are 
shown in the following table 

10 



ratio 
(Y/X) 


recovery 
= (Y/{X+Y)} 


Loss 
= {X/(X+Y}} 


1 


50% 


50% 


2 


67% 


33% 


3 


75% 


25% 


4 


80% 


20% 


9 


90% 


10% 


19 


95% 


5% 


99 


99% 


1 % 


999 


99.9 % 


0.1 % 



After n cycles, the recovery for a phenotype-determining 
fragment will be given by {Y/(X+Y)} n 

15 Loss of general SNP-containing {'noise') fragments during inter- 
population perfectly matched duplex depletion cycles 

If we anneal fragmented DNA molecules together and capture 
only the mismatch-containing duplexes, then n repetitions of such a 
process will reduce the original number of fragments to the following 
20 fraction 



{1 .(e ( Fy500 >)} n .{Y/(X+Y)} n 



WO 00/55364 



PCT/GB00/00916 



- 13- 

The enrichment for phenotype-determining fragments over 
SNP-containing ('noise') fragments during inter-population perfectly 
matched duplex depletion cycles will therefore be given by 

{1 _ (e -(F/500) )r n 

Example figures for enrichment are given below with F = 50, 
100, 200. 300, 400, 500, 600, 700, 800, 900 and 1,000 and n = 1, 2, 3, 4, 5, 
6 ,7, 8, 9 and 10. 





n = 


F 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


50 


11 


110 


1,160 


12,194 


128,135 


1.346,489 


1.E+07 


1.E+08 


2.E+09 


2.E+10 


100 


6 


30 


168 


926 


5,110 


28,187 


155,500 


857,841 


4.732,415 


3.E+07 


200 


3 


9 


28 


85 


257 


779 


2.362 


7.166 


21,735 


65,929 


300 


2 


5 


11 


24 


53 


119 


263 


582 


1,291 


2,860 


400 


2 


3 


6 


11 


20 


36 


65 


118 


215 


390 


500 


2 


3 


4 


6 


10 


16 


25 


39 


62 


98 


600 




2 


3 


4 


6 


9 


12 


18 


25 


36 


700 




2 


2 


3 


4 


5 


7 


10 


13 


17 


800 




2 


2 


2 


3 


4 


5 


6 


8 


10 


900 




1 


2 


2 


2 


3 


4 


4 


5 


6 


1,000 




1 


2 


2 


2 


2 


3 


3 


4 


4 



Loss of specific SNP-containing ('noise') fragments during inter- 
population perfectly matched duplex depletion cycles 

The non-polymorphic and SNP-containing ('noise 1 ) fragments 
15 will be depleted as described above. 

Not all fragments will, however, be depleted at the same rate. 
An individual SNP-containing ('noise') fragment will be depleted with every 
cycle of inter-population perfectly matched duplex depletion as outlined 
below. 

20 Let us assume that there are two alleles for the polymorphism 

within a particular fragment. Let these be called P and Q. Let p be the 
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fraction of the P allele in the (outbred) 'affected' and 'unaffected 1 
populations and let q be the fraction of the Q allele. Let (p+q)=1 , so that 

q=(1-p). 

After denaturation and annealing, the four possible events are 
5 PP, PQ. QP and QQ. PP and QQ will form perfectly matched duplexes and 
will therefore be lost - whereas PQ and QP will form mismatch-containing 
duplexes and will consequently be recovered. 

After one cycle, the fraction of recovered molecules will be 



10 (2pq/(p 2 +2pq+q 2 )} 

= {2pq/((p+q) 2 )} 
= 2pq 

15 

= 2p(1-p) 



Hence if we start out with M molecules of DNA from the 
population, there will be 2Mpq molecules remaining after the first round of 
20 inter-population mismatch-containing duplex selection. Let us denote the 
number of molecules entering the second round of inter-population 
mismatch-containing duplex selection by M\ where 



M' = 2Mpq 

25 

and the fraction of lost molecules will be 



{(p 2 +q 2 )/(p 2 +2pq+q 2 )} 

30 ={(p 2 +q 2 )/((p+q) 2 )} 
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2 2 

= P +q 

= p 2 + {1-2 P+ p 2 } 
5 = l-2p+2p 2 

= 1-2p(1-p) 
=1-2pq 

io 

The fractional loss of P-alielic molecules will be p 2 and the 
fractional loss of Q-allelic molecules will be q 2 ( = (1-p) 2 ). 

If we start out with M molecules of DNA from the population, 
there will be Mp of the P-alleiic molecules and Mq = M(1-p) of the Q-allelic 
15 molecules before inter-population mismatch-containing duplex selection. 
After inter-population mismatch-containing duplex selection, there will 
therefore be Mp-Mp 2 = Mp(1-p) = Mpq of the P-allelic molecules and Mq- 
Mq 2 = Mq(1-q) = Mqp of the Q-allelic molecules, in other words, after the 
first round of mismatch-containing duplex selection, there will be the same 
20 number of P-allelic molecules as Q-allelic molecules. 

We can define new allelic frequencies p' and q f as follows 

p' = q' = 0.5 

25 If we now perform a second round of inter-population 

mismatch-containing duplex selection, we start out with M' molecules of 
DNA from the first round. There will be M'p' of the P-ailelic molecules and 
M'q 1 of the Q-allelic molecules before inter-population mismatch-containing 
duplex selection. 

30 After inter-population mismatch-containing duplex selection, 

there will therefore be M'p'-Mpp' = M'p'(1-p) = M'p'q of the P-allelic 
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molecules and M q'-Mqq' = M'q : (1-q) = M'q'p of the Q-altelic molecules. 
The total number of molecules (M") wiii be 

M'pq + M'q'p 

5 

= M'(p'q-q'p) 

but p' = q 3 = 0.5 
io hence M" = 0.5*M'(p+q) 

but (p+q) = 1 
so M" = (MV2) 

15 

In other words, after the second round of inter-population 
mismatch-containing duplex selection, there will again be the same number 
of P-allelic molecules as Q-allelic molecules. Thus the new allelic 
frequencies p" and q" remain as previously p" = q" = 0.5. 

20 A pattern now emerges. After the first cycle of inter- 

population mismatch-containing duplex selection, the allelic frequencies are 
equalled and the number of molecules is reduced to 2Mpq (Mpq of each 
allelic molecule). Thereafter, every cycle halves the number of molecules 
and keeps both alleles at the same frequency. 

25 Consequently, after n cycles, the number of P-allelic 

molecules will be 

Mpq(0.5) n - 1 

30 and the number of Q allelic molecules will be 
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Mqp(0.5) n_1 



both are. of course, equal. 

If we now take the capture yield (see above) into 
5 consideration, the SNP-containing ('noise') fragment yield will be given by 



2Mpq(0.5) n " 1 -{Y/(X+Y)} n 



where both allelic variants are deemed to be captured 'noise'. 

io 

Polymorphisms that interfere with the pattern of restriction digestion 

For both the loss of general and specific SNP-containing 
('noise') fragments during inter-population perfectly matched duplex 
depletion cycles (described above) and where the SNP interferes with the 

15 pattern of restriction digestion, if the mismatch-binding protein also binds to 
duplex molecules with unequal lengths (e.g. from inter-population 
annealing around a site of restriction site polymorphism), then the above 
analysis still holds (with perfectly matched duplex being replaced by equal 
length duplex and mismatch-containing duplex being replaced by unequal 

:o partner-length duplex). 

In the rare cases where a restriction site is lost due to a 
sequence change that actually determines the phenotype of interest, a 
double-length fragment will be obtained. This will give rise to a double 
terminal restriction site profiling array (TRSPA) signature (see below). 

25 Multiple isolates of the particular double signature will be indicative of an 
association between the fragment and the phenotype of interest. 
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Further 'kinetic 1 enrichment to enhance the selective removal of SNP- 
containing noise' from the pool of phenotype determining fragments 

After multiple cycles of enrichment by the above procedure, 
the enriched DNA pool should contain many copies of all phenotype- 

5 determining fragments but also iow numbers of copies of many different 
phenotype non-determining fragments. The total number of 'noise' 
fragments may exceed the number of phenotype determining fragments, 
despite the number of each individual 'noise 1 species being very small. The 
'noise' fragments would therefore increase the number of probes required 

io for TRSPA analysis before a pattern emerges. To largely eliminate this 
problem, a further kinetic enrichment procedure is used. Either one or both 
of strategies A and B below can be employed to achieve 'kinetic 1 
enrichment. 

15 Strategy A - Subtraction of the enriched DNA from inter-population 
mismatch containing duplex depletion 

The enriched fragment pool from inter-population mismatch 
containing duplex depletion is rapidly self-hybridised - enabling the 
common phenotype-determining fragments to form perfectly matched 
20 duplexes with greater efficiency than the rare 'noise' fragments. Selection 
for perfectly matched duplexes then yields a selectively further enriched 
pool of fragments. Multiple cycles of subtraction could be carried out if 
necessary, 

25 Strategy B - Hybridisation of the enriched DNA from inter-popuiation 
mismatch containing duplex depletion against the 'affected' DNA pool 

The enriched fragment pool from inter-popuiation mismatch 
containing duplex depletion is then hybridised to an excess of biotinyiated 
DNA from the 'affected' pool. This allows the common phenotype- 
30 determining fragments to form perfectly matched duplexes with greater 
efficiency than the rare 'noise 1 fragments. Selection for perfectly matched 
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duplexes followed by streptavidin capture and denaturation to release the 
non-biotinyiated strands then yields a further enriched pool of fragments. 
Multiple such 'affected' pool back-hybridisations could be carried out if 
necessary. 

5 

Extension of the invention to the case of single phenotypically 
'affected' individuals within populations where the distinction 
between 'affected 5 and 'unaffected' is clear 

The above has described inter-population perfectly matched 
io duplex depletion between non-biotinylated DNA fragments from an 

'affected' population and biotinylated DNA fragments from an 'unaffected' 
population. Provided the 'unaffected' population is sufficiently complex that 
it contains all the non-phenotype-determining sequence variants found in a 
single Effected 1 individual, then inter-population perfectly matched duplex 
15 depletion should be possible for single phenotypically 'affected' individuals 
against an 'unaffected 1 population where the distinction between 'affected 1 
and 'unaffected 5 is clear. The latter proviso is needed in order to ensure 
that a small number of misdiagnosed 'affected 1 individuals in the 
'unaffected' population do not cause the removal of phenotype-determining 
20 fragments during inter-population perfectly matched duplex molecular 
depletion. 

Extension of the invention to the case of disease gene identification 
in cases where novel phenotype-determining mutations arise 
25 spontaneously within a family 

Except for a small number of sequence changes, each of us 
contains DNA sequence derived from our parents - our individuality 
resulting from precisely which parental alleles we receive. If one of the 
above small number of sequence changes results in a change in 
30 phenotype, then we can use inter-population perfectly matched duplex 
depletion to enrich for fragments encoding this change in phenotype. 
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If we take 'unaffected total ancestral' cells (by which we mean 
cells derived from a complete set of 'unaffected' ancestors - e.g. both 
parents, or mother plus two paternal grandparents, or father plus two 
maternal grandparents, or two maternal grandparents and two paternal 

5 grandparents etc. ) as the source of our biotinylated fragments and cells 
from an 'affected' descendent as the source of our non-biotinylated 
fragments, any fragments that have acquired phenotype-determining 
sequence changes between 'unaffected 1 ancestral generations and the 
'affected 1 descendent generation will be unable to form perfectly matched 

io duplexes with the biotinylated 'unaffected total ancestral 1 fragments. 
Successive cycles of such inter-popuiation perfectly matched duplex 
depletion will thus lead to the enrichment of fragments carrying all such 
sequence - the degree of enrichment per cycle being as described below. 

15 Statistical analysis 

Let us assume that equal numbers of fragments are used 
from each of the 'unaffected' ancestors. Let the number of such ancestors 
be A. 

{1/A} of the annealings in the inter-popuiation perfectly 
20 matched duplex depletion will be seif-against-ancestral-transmitted alleles - 
statistically equivalent to self-against-self inter-population perfectly matched 
duplex depletion (see below). 

{(A-1)/A} of the annealings in the inter-population perfectly 
matched duplex depletion will be self-against-ancestrai-nontransmitted 
25 alleles - statistically equivalent to inter-population perfectly matched duplex 
depletion between unrelated individuals. 



A 


{1/A} 


{<A-1)/A} 


2 


1/2 


V2 


3 


1/3 


2/3 


4 


1/4 


3 /4 
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From the data of Nickerson et al a DNA sequence variation 
between unrelated individuals should occur every 500 base pairs (it is this 
figure that we use for inter-population perfectly matched duplex depletion 
with seif-against-ancestral-nontransmitted alleles and inter-population 

5 perfectly matched duplex depletion between unrelated individuals). In 
addition, a given individual should be heterozygous once every 573 base 
pairs (this figure is used for inter-population perfectly matched duplex 
depletion with self-against-ancestral-transmitted alleles and self-against- 
self inter-population perfectly matched duplex molecular depletion). Inter- 

io population perfectly matched duplex depletion against the transmitted 
alleles and the nontransmitted alleles will now be considered separately. 

Self-against-ancestral-transmitted alleles inter-population perfectly 
matched duplex depletion 

is !f we anneal the two complementary strands for a fragment 

using DNA from an 'affected' descendent and DNA containing the 
transmitted alleles of 'unaffected' ancestors, then {1-e ( ~ F/573) } of fragments 
will contain one or more site of heterozygosity. {1/A} of the annealings will 
be of this type. For such annealings, where a site of heterozygosity is 

:o present, the probability of obtaining a mismatch-containing duplex between 
a biotinylated fragment and a non-biotinylated fragment containing the site 
of heterozygosity is 0.5. 

Self-against-ancestral-nontransmitted alleles inter-population 
25 perfectly matched duplex depletion 

If we anneal the two complementary strands for a fragment 
using DNA from an 'affected 1 descendent and DNA containing the 
nontransmitted alleles of 'unaffected' ancestors, then {1-e ( ~ F/500) } of 
fragments will contain one or more site of DNA sequence variation. {(A- 
30 1 )/A} annealings will be of this type. 
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Phenotype-determining fragment enrichment 

The fraction of fragments carried through the first cycle of 
inter-population perfectly matched duplex depletion will therefore be 

<[0.5-{1/AH1-e™ 

where {Y/(X+Y)} represents the yield of streptavidin- 
capturable molecules. 

Hence n repetitions of such a process will reduce the original 
in number of fragments to the following fraction 

([0.5-{1 /AH1 -e ( ' F/573) }M{(A-1 )/A}.{1 -e ( - F/500} }3) n ^{Y/(X + Y)} n 

The enrichment for phenotype-determining fragments over 
is SNP-containing ('noise 1 ) fragments during inter-population perfectly 
matched duplex depletion cycles will therefore be given by 

([0.5-{1/AH1-e ( - F ^ 

2o Example figures for enrichment are given for A = 2, 3 and 4 

below with F = 50, 100, 200, 300, 400, 500, 600, 700, 800, 900 and 1 ,000 
and n = 1, 2, 3, 4 , 5 ,6 ,7, 8, 9 and 10. 
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For A=2 





n= 




F 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


50 


15 


213 


3,115 


45,494 


664.427 


9703,694 


I.E-t-08 


2.E+09 


3.E+10 


4.E+11 


100 


8 


59 


448 


3.430 


26.250 


200.887 


1.537.367 


1.E+07 


9.E+07 


7.E+08 


200 


4 


18 


74 


309 


1.296 


5.434 


22.782 


95.524 


400.520 


1,679,340 


300 


3 


9 


28 


87 


265 


811 


2,475 


7,558 


23,078 


70,468 


400 


2 


6 


16 


39 


97 


241 


600 


1,497 


3,734 


9,313 


500 


2 


5 


10 


22 


48 


103 


224 


485 


1,051 


2,277 


600 


2 


4 


7 


15 


29 


56 


109 


213 


416 


813 


700 


2 


3 


6 


11 


19 


35 


63 


114 


207 


374 


800 


2 


3 


5 


8 


14 


24 


42 


71 


121 


205 


900 


2 


3 


4 


7 


11 


18 


30 


49 


79 


128 


1,000 


2 


2 


4 


6 


9 


15 


23 


36 


57 


89 



For A=3 

5 





n= 




F 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


50 


13 


167 


2.154 


27.823 


359,340 


4,640,941 


6.E+07 


8.E+08 


1.E+10 


1.E+11 


100 


7 


46 


311 


2.104 


14,251 


96,516 


653.682 


4.427,241 


3.E+07 


2.E+08 


200 


4 


14 


51 


191 


709 


2,633 


9,786 


36,365 


135,135 


502,173 


300 


3 


7 


20 


54 


146 


396 


1.073 


2,909 


7.882 


21,361 


400 


2 


5 


11 


24 


53 


119 


263 


582 


1.290 


2,859 


500 


2 


4 


7 


14 


27 


51 


99 


190 


367 


707 


600 


2 


3 


5 


9 


16 


28 


48 


84 


147 


255 


700 


2 


3 


4 


7 


11 


18 


28 


46 


74 


119 


800 


2 


2 


4 


5 


8 


12 


19 


28 


43 


66 


900 


1 


2 


3 


4 


6 


9 


14 


20 


29 


41 


1,000 


1 


2 


3 


4 


5 


8 


11 


15 


21 


29 
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For A=4 





n= 




F 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


50 


12 


149 


1,826 


22.316 


272,759 


3.333,771 


4.E+07 


5.E+08 


6.E+09 


7.E+10 


100 


6 


41 


264 


1.690 


10.834 


69,465 


445.373 


2.855.512 


2.E+07 


1.E+08 


200 


4 


12 


44 


154 


540 


1.902 


6.696 


23,570 


82,967 


292,045 


300 


3 


7 


17 


44 


112 


287 


737 


1.894 


4,864 


12,494 


400 


2 


4 


9 


20 


41 


86 


181 


381 


800 


1,681 


500 


2 


3 


6 


11 


20 


37 


68 


125 


229 


418 


600 


2 


3 


5 


7 


12 


20 


34 


56 


92 


152 


700 


2 


2 


4 


5 


8 


13 


20 


30 


46 


71 


800 


1 


2 


3 


4 


6 


9 


13 


19 


27 


39 


900 


1 


2 


3 


4 


5 


7 


9 


13 


18 


25 


1.000 


1 


2 


2 


3 


4 


6 


7 


10 


13 


17 



Extension of the invention to the case of fully comprehensive 
5 'abnormal' cell mutational profiling within an individual 

if we now take 'normal 1 cells as the source of our biotinyiated 
fragments and 'abnormal' ceils from the same individual as the source of 
our non-biotinylated fragments, any fragments that have acquired 
sequence changes on the way to becoming 'abnormal' will be unable to 

lo form perfectly matched duplexes with the biotinyiated 'normal' fragments. 
Successive cycles of such inter-population perfectly matched duplex 
depletion will thus lead to the enrichment of fragments carrying all those 
sequence differences between the 'normal' cells and the 'abnormal' cells - 
the degree of enrichment per cycle being as described below. 

i5 From the data of Nickerson et a/, a given individual should be 

heterozygous about once every 573 base pairs. 

If we anneal the two complementary strands for a fragment 
using DNA from 'abnormal' cells and DNA from 'normal' cells, then 
{1-e<~ F/573) } of fragments will contain one or more site of heterozygosity. 

20 For a heterozygous site, p and q are both 0.5. The probability 

of obtaining a perfectly matched duplex between a biotinyiated fragment 
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10 



20 



and a non-biotinylated fragment containing the site is 0.5. Similarly, the 
probability of obtaining a mismatch-containing duplex between a 
biotinylated fragment and a non-biotinylated fragment containing the site is 
also 0.5. 

The fraction of fragments carried through the first cycle of 
inter-population perfectly matched duplex depletion will therefore be 

0.5.{1-e ( - F/573) HY/(X+Y)} 

hence n repetitions of such a process will reduce the original 
number of fragments to the following fraction 



[0.5.{1-e<- F/573) }] n -{Y/(X+Y)} n 

1 5 The enrichment for phenotype-determining fragments over 

SNP-containing ('noise') fragments during inter-population perfectly 
matched duplex depletion cycles will therefore be given by 



[0.5.{1-e ( - R573) }T n 

Example figures for enrichment are given below with F = 50, 
100, 200, 300, 400, 500, 600, 700, 800, 900 and 1,000 and n = 1, 2, 3, 4, 5, 
6, 7, 8, 9 and 10. 
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n= 




F 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


50 


24 


573 


13.711 


328,171 


7.854.627 


2.E+08 


4.E+09 


1.E~11 


3.E+12 


6.E+13 


100 


12 


156 


1.948 


24.329 


303,844 


3.794,731 


5.E+07 


6.E+08 


7.E+09 


9.E+10 


200 


7 


46 


313 


2.123 


14.412 


97,830 


664.076 


4.507.784 


3.E+07 


2.E+08 


300 


5 


24 


118 


580 


2.845 


13,958 


68.490 


336.072 


1,649,059 


8,091,708 


400 


4 


16 


63 


251 


999 


3,977 


15,831 


63.012 


250.815 


998,346 


500 


3 


12 


41 


139 


479 


1.644 


5,650 


19,411 


66,688 


229,114 


600 


3 


9 


29 


90 


278 


B56 


2,638 


8,128 


25,047 


77,179 


700 


3 


8 


23 


65 


183 


520 


1.475 


4,183 


11,862 


33,639 


800 


3 


7 


19 


50 


133 


353 


937 


2,491 


6,621 


17,599 


900 


3 


6 


16 


41 


103 


259 


654 


1.652 


4.171 


10,532 


1,000 


2 


6 


14 


34 


84 


202 


490 


1,188 


2,880 


6,978 



This approach will enrich for fragments containing all 
sequence differences within the 'abnormal' cells - no prior knowledge of the 
5 genes (oncogenes, tumour suppressor genes etc. ) that may need to be 
investigated is required. 

Having thus isolated a fragment (or fragments) determining 
differences between the 'affected 1 and 'unaffected' populations, we can 
now proceed to their analysis. 

10 

Aspect (II) 

In another aspect the invention provides a method of making 
a set of arrays of fragments of DNA of interest, which method comprises: 



a) selecting, from a set of n restriction endonuclease enzymes, a 
is subset of r restriction endonuclease enzymes; 

b) digesting genomic DNA with the subset of r enzymes; 

c) ligating to the resulting fragments restriction-enzyme-cutting- 
site-specific adapters with unique polymerase chain reaction amplifiable 
sequences; 

20 d) splitting the resulting fragments into r 2 aliquots; 

e) amplifying each aliquot with two-restriction enzyme-specific 
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p rimers; 

f) forming an array of the r 2 aliquots of the ampiimers; 
and 

g) repeating steps a) to f) using a different subset of r restriction 
5 endonuclease enzymes. The invention also includes sets of arrays 

obtained or obtainable by the method. 

The n restriction endonuclease enzymes may be selected 
from 4-cutters and 5-cutters and 6-cutters, and a set may include enzymes 
from one or two or three of these categories. The value of n is preferably 3 
io to 10. for reasons discussed beiow. The value of r is less than n and is 
preferably 2 to 4, chosen with reference to the frequency with which the 
chosen enzymes cut nucleic acids, and ease of fragment amplification by 
PCR. 

15 Terminal restriction site profiling arrays (TRSPAs) 

If we use a total of n 6 bp cutter restriction enzymes within the 
total set of enzymes used for fragmentation, let us use subsets of r 6 bp 
cutter enzymes (taken from the total set n) to make (rxr) TRSPA test 
matrices as follows. 

20 

Version 1 -(TRSPA-1) 

For each (rxr) TRSPA test matrix, digest DNA to completion 
with all r restriction enzymes. Ligate on restriction enzyme cutting-site- 
specific adaptors with unique polymerase chain reaction amplifiable tags. 
25 Split into r 2 aliquots and for each aliquot, amplify with biotinylated restriction 
enzyme, and non-biotinylated restriction enzyme k tag-specific primers and 
array the non-biotinylated strands for all values of j and k between 1 and r. 
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Example 

Consider the following dsDNA 

_. [A ]~1 -[B]-2--[B]»3-[C]-4-[A]-5-[B]» + 
5 --[A]--l --[B3"2"[B]--3--[Ch4-[A]-5--[B]" - 

w here the restriction enzyme cutting sites are denoted A, B 
and C and the fragments after restriction digestion are denoted 1 , 2, 3, 4 
and 5. + and - denote the sense of the strands, 
o Cut to completion with A. B and C to give 
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A--1--B B--2--B B--3--C C-4-A A--5--B + 
A--1--B B--2--B B--3-C C-4--A A--5--B - 

Polymerase chain reaction amplify according to the following 
primer matrix and streptavidin capture to give 





A 


B 


C 


bio-A 




bio-A--1--B + 
A--1--B- 

bio-A--5--B + 
A--5--B - 


C--4--A + 
C--4--A-bio - 


bio-B 


A-l-B - 
A~1--B-bJO- 

A--5--B + 
A--5--B-bio - 


bio-B--2»B + 
B--2--B - 

B-2--B + 
B--2--B-bio - 


bio-B«3»C + 
B--3--C - 


bio-C 


bio-C--4--A + 
C--4--A - 


B-3--C + 
B-3-C-bio - 





Keep onfy the non-biotinylated strands to give 





A 


B 


C 


bio-A 




A-1--B- 
A--5--B - 


C-4--A + 


bio-B 


A--1--B + 
A-5--B + 


B--2-B - 
B--2--B + 


B-3--C - 


bio-C 


C--4--A - 


B--3--C + 





10 
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In simplified form 





A 


B 


C 


bio-A 




K 5- 


4+ 


bio-B 


1+. 5+ 


2-, 2-f 


3- 


bio-C 


4- 


3+ 





Repeat for all n C r combinations of restriction enzymes to 
5 generate the TRSPA. 

TRSPA-1 hybridisation patterns 

The two types of TRSPA-1 hybridisation pattern we should 
expect using a probe resulting from inter-population perfectly matched 
io duplex depletion are 

t. Hybridisation to an off-diagonal element (e.g. row x, column y 

- where x and y are different) and its complementary element reflected 
across the diagonal (i.e. row y, column x) and 

2, Hybridisation to an on-diagonal element (e.g. the element at 

15 row z, column z). 

TRSPA-1 analysis - a worked example for n=3 and r=2 

Let us take the dsDNA 

20 -C-1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
where 

A, B and C denote restriction enzyme cutting sites. 1 , 2, 3, 4, 
5, 6, 7, 8 and 9 denote the restriction fragments after digestion. 

25 

The TRSPA-1 test matrices 

There are 3 C 2 =3 TRSPA-1 test matrices - AB, BC and AC. 



WO 00/55364 



PCT/GBOO/00916 



-31 - 



The test matrix hybridisation patterns 

There are {(2-{2+1))/2>=3 possible hybridisation patterns for 
each of the test matrices 



H5 



U 



The combinatorial diversity 

If there are three possible TRSPA-1 test matrix hybridisation 
patterns and three possible test matrices, then there will be 3 3 =27 possible 
TRSPA-1 signatures. 

10 

Fragment 4 analysis - the AB TRSPA-1 matrix 

Take the dsDNA 

-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 

15 

Cut with A and B to give 

-C-1-B 
B-2-B 
20 B-3-A 
A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 



25 



The fragment complementary to fragment 4 will be 



A-4-A 
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Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 



AB matrix j A 


B 


bio-A |9H 


bio-B 





Fragment 4 analysis - the BC TRSPA-1 matrix 

Take the dsDNA 



io 



-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with B and C to give 



-C-1-B 
B-2-B 

B-3-A-4-A-5-C 

15 C-6-C 
C-7-B 
B-8-A-9-A- 



The fragment complementary to fragment 4 will be 



B-3-A-4-A-5-C 



Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 

25 
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BC matrix 


B 


C | 


bio-B 






bio-C 







Fragment 4 analysis - the AC TRSPA-1 matrix 

Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 

Cut with A and C to give 

io -C 

C-1-B-2-B-3-A 

A-4-A 

A-5-C 

C-6-C 
15 C-7-B-8-A 

A-9-A 

A- 

The fragment complementary to fragment 4 will be 

20 

A-4-A 

Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 



AC matrix 


A 


C 


bio-A 




■ 




bio-C 
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Fragment 5 analysis - the AB TRSPA-1 matrix 

Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



Cut with A and B to give 



-C-1-B 
B-2-B 
id B-3-A 
A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 



The fragment complementary to fragment 5 will be 



A-5-C-6-C-7-B 

Polymerase chain reaction amplify and hybridise with just 
fragment 5 to give 



Fragment 5 analysis - the BC TRSPA-1 matrix 

Take the dsDNA 

-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 




' 'I I'M ' '1 I ' 



II 
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Cut with B and C to give 

-C-1-B 
B-2-B 
5 B-3-A-4-A-5-C 
C-6-C 
C-7-B 
B-8-A-9-A- 

io The fragment complementary to fragment 5 will be 

B-3-A-4-A-5-C 

Polymerase chain reaction amplify and hybridise with just 
15 fragment 5 to give 



BC matrix 

bio-B 

bio-C 



Fragment 5 analysis - the AC TRSPA-1 matrix 

Take the dsDNA 

20 

-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and C to give 

25 -C 

C-1-B-2-B-3-A 

A-4-A 

A-5-C 
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C-6-C 



C-7-B-8-A 



A-9-A 



10 



15 



20 



The fragment complementary to fragment 5 will be 



Polymerase chain reaction amplify and hybridise with just 
fragment 5 to give 



Fragment 6 analysis - the AB TRSPA-1 matrix 

Take the dsDNA 

-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



Cut with A and B to give 



-C-1-B 
B-2-B 
B-3-A 
A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 



A-5-C 
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The fragment complementary to fragment 6 will be 



A-5-C-6-C-7-B 

Polymerase chain reaction amplify and hybridise with just 
fragment 6 to give 



Fragment 6 analysis - the BC TRSPA-1 matrix 



-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



Cut with B and C to give 



-C-1-B 
B-2-B 

B-3-A-4-A-5-C 
C-6-C 
20 C-7-B 

B-8-A-9-A- 



The fragment complementary to fragment 6 wiil be 




10 



Take the dsDNA 



15 



25 



C-6-C 



Polymerase chain reaction amplify and hybridise with just 
fragment 6 to give 
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BC matrix 


B 


C 


bio-B 






bio-C 







Fragment 6 analysis - the AC TRSPA-1 matrix 

Take the dsDNA 

5 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and C to give 

io -C 

C-1-B-2-B-3-A 

A-4-A 

A-5-C 

C-6-C 
15 C-7-B-8-A 

A-9-A 

A- 

The fragment complementary to fragment 6 will be 

20 

C-6-C 

Polymerase chain reaction amplify and hybridise with just 
fragment 6 to give 



AC matrix 


A 


C 


bio-A 






bio-C 







25 
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Overall results 



Fragment 4 TRSPA-1 analysis 



AB 



BC 



AC 



Fragment 5 TRSPA-1 analysis 



io 



AB 




AC 



Fragment 6 TRSPA-1 analysis 



AB 



BC 



AC 




Version 2 - (TRSPA-2) 

For each (rxr) TRSPA test matrix, digest DNA to completion 
with all r restriction enzymes. Ligate on restriction enzyme cutting-site- 
specific adaptors with unique polymerase chain reaction amplifiable tags. 
Split into r 2 aliquots and for each aliquot, amplify with non-biotinylated 
restriction enzymej and non-biotinylated restriction enzyme* tag-specific 
primers and array the denatured strands for all values of j and k between 
1 and r. 



20 
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Exarnple 

Consider the following dsDNA 

--[ A]« 1 --[B]-2--[B3"3-[C]"4-[A]--5"[B]" + 
5 -[A]-1--[B]-2-[B]-3-[C]"4-[A]--5"[B]- - 

where the restriction enzyme cutting sites are denoted A, B 
and C and the fragments after restriction digestion are denoted 1, 2, 3, 4 
and 5. + and - denote the sense of the strands. 
]0 Cut to completion with A, B and C to give 

A--1--B B-2--B B--3--C C--4--A A--5-B + 
A-1-B B--2-B B--3--C C--4--A A--5--B - 



15 Polymerase chain reaction amplify according to the following 

primer matrix to give 
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A 


B 


C 


A 

A 




A „1 „g + 


Q„4__A + 






A-1--B- 


C-4--A - 






D £ D + 








B-2--B - 








M — D — D -r 








A--5-B - 




B 


A--1--B + 


D O D ! 


P. _0__D . 
D--^~D -r 




A-l-B- 


B--2-B - 


B--2--B - 




B--2--B + 




n q u. 




B--2--B - 




B-3-C - 




A--5--B + 








A--5--B - 






C 


C--4--A + 


B-2--B + 






C--4--A - 


B--2-B - 








B-3--C + 








B--3--C - 





in simplified form 





A 


B 


C 


A 




1.2.5 


4 


B 


1.2,5 


2 


2, 3 


C 


4 


2.3 





5 

Repeat for all n C, combinations of restriction enzymes to 
generate the TRSPA. 
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TSPSA-2 hybridisation patterns 

The two types of TRSPA-2 hybridisation pattern we should 
expect using a probe resulting from inter-population perfectly matched 
duplex depletion are 
5 1 . Hybridisation to an off-diagonal element (e.g. row x, column y 

- where x and y are different) and its complementary element reflected 
across the diagonal (i.e. row y, column x) and 

2. Hybridisation to a whole row and column intersecting at an 

on-diagonal element (e.g. ail of row z and all of column z), 

10 

TRSPA-2 analysis - a worked example for n=3 and r=2 

Let us take the dsDNA 



15 



-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
where 

A, B and C denote restriction enzyme cutting sites. 1,2,3, 4, 
5, 6, 7 f 8 and 9 denote the restriction fragments after digestion. 



20 The TRSPA-2 test matrices 

There are 3 C 2 =:3 TRSPA-2 test matrices - AB, BC and AC. 



The test matrix hybridisation patterns 

There are {(2-(2+1))/2}=3 possible hybridisation patterns for 
25 each of the test matrices 

W*. 

HZJ EJ H 



The combinatorial diversity 

If there are three possible TRSPA-2 test matrix hybridisation 
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pattems and three possible test matrices, then there will be 3 3 =27 possible 
TRSPA-2 signatures. 

Fragment 4 analysis - the AB TRSPA-2 matrix 



-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



Cut with A and B to give 



-C-1-B 
B-2-B 
B-3-A 
A-4-A 
15 A-5-C-6-C-7-B 
B-8-A 
A-9-A- 



The fragment complementary to fragment 4 wilt be 



Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 

25 



Take the dsDNA 



10 



20 



A^4-A 



B 



AB matrix 



A 




Fragment 4 analysis - the BC TRSPA-2 matrix 

Take the dsDNA 
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-C- 1 -B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 

Cut with B and C to give 

-C-1-B 
5 B-2-B 

B-3-A-4-A-5-C 
C-6-C 
C-7-B 
B-8-A-9-A- 

10 

The fragment comptementary to fragment 4 will be 
B-3-A-4-A-5-C 

15 Polymerase chain reaction amplify and hybridise with just 

fragment 4 to give 



BC matrix 

B 

C 



Fragment 4 analysis - the AC TRSPA-2 matrix 

20 Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and C to give 

25 

-c 

C-1-B-2-B-3-A 
A-4-A 
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10 



A-5-C 

C-6-C 

C-7-B-8-A 

A-9-A 

A- 



The fragment complementary to fragment 4 will be 



A-4-A 



Polymerase chain reaction amplify and hybridise with just 
fragment 4 to give 



AC matrix 



mam 



15 Fragment 5 analysis - the AB TRSPA-2 matrix 

Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



20 



Cut with A and B to give 



-C-1-B 
B-2-B 
B-3-A 
25 A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 
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The fragment complementary to fragment 5 will be 
A-5-C-6-C-7-B 

5 Polymerase chain reaction amplify and hybridise with just 

fragment 5 to give 



AB matrix 

A 

B 



Fragment 5 analysis - the BC TRSPA-2 matrix 

io TakethedsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6^-7-B-8-A-9-A- 
Cut with B and C to give 

15 

-C-1-B 
B-2-B 

B-3-A-4-A-5-C 
C-6-C 
20 C-7-B 

B-8-A-9-A- 

The fragment complementary to fragment 5 will be 

25 B-3-A-4-A-5-C 

Polymerase chain reaction amplify and hybridise with just 
fragment 5 to give 
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BC matrix 

B 

C 



Fragment 5 analysis - the AC TRSPA-2 matrix 

Take the dsDNA 

5 

<M-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
Cut with A and C to give 

10 -C 

C-1-B-2-B-3-A 

A-4-A 

A-5-C 

C-6-C 
15 C-7-B-8-A 

A-9-A 

A- 

The fragment complementary to fragment 5 will be 

20 

A-5-C 

Polymerase chain reaction amplify and hybridise with just 
fragment 5 to give 

AC matrix 
~A 
~C 

25 
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5 



10 




M 20 



25 



Fragment 6 analysis - the AB TRSPA-2 matrix 

Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 



Cut with A and B to give 



-C-1-B 
B-2-B 
B-3-A 
A-4-A 

A-5-C-6-C-7-B 

B-8-A 

A-9-A- 



The fragment complementary to fragment 6 will be 



A-5-C-6-C-7-B 

Polymerase chain reaction amplify and hybridise with just 
fragment 6 to give 



Fragment 6 analysis - the BC TRSPA-2 matrix 

Take the dsDNA 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 
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Cut with B and C to give 

-C-1-B 

B-2-B 
5 B-3-A-4-A-5-C 
C-6-C 
C-7-B 
B-8-A-9-A- 

i o The fragment complementary to fragment 6 will be 

C-6-C 

Polymerase chain reaction amplify and hybridise with just 
15 fragment 6 to give 



BC matrix 
_ 

"C 

Fragment 6 analysis - the AC TRSPA-2 matrix 

Take the dsDNA 

20 

-C-1-B-2-B-3-A-4-A-5-C-6-C-7-B-8-A-9-A- 

Cut with A and C to give 

25 -C 

C-1-B-2-B-3-A 

A-4-A 

A-5-C 
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C-6-C 

C-7-B-8-A 

A-9-A 

A- 

5 

The fragment complementary to fragment 6 will be 

C-6-C 

10 Polymerase chain reaction amplify and hybridise with just 

fragment 6 to give 



AC matrix 

A 

C 



Overall results 

15 

Fragment 4 TRSPA-2 analysis 

AB BC AC 

Fragment 5 TRSPA-2 analysis 

20 

AB BC AC 




in pi' "i" 
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Fragment 6 TRSPA-2 analysis 



AB BC AC 




The number of test matrices 

5 For a total of n enzymes used for fragmentation and a panel 

of r enzymes per (rxr) test matrix, there wiii be n C r possible (rxr) test 
matrices, where 

n C r = (n!)/[(n-r)k!] 

n C f for various n and r is given in the following tabie 





n- 


r 


3 


4 


5 


6 


7 


8 


9 


10 


1 


3 


4 


5 


6 


7 


8 


9 


10 


2 


3 


6 


10 


15 


21 


28 


36 


45 


3 


1 


4 


10 


20 


35 


56 


84 


120 


4 




1 


5 


15 


35 


70 


126 


210 


5 






1 


6 


21 


56 


126 


252 


6 








1 


7 


28 


84 


210 


7 










1 


8 


36 


120 


8 












1 


9 


45 


9 














1 


10 


10 
















1 



The number of TRSPA spots arrayed 

15 The total number of TRSPA spots is given by 



^{ n Cr} = r 2 *{(n!)/[(n-r)!.r!]} 
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i^-fCr} for various n and r is given in the following table 





n = 


H 


3 


4 


5 


6 


7 


8 


9 


10 


1 


3 


4 


5 


6 


7 


8 


9 


10 


2 


12 


24 


40 


60 


84 


112 


144 


180 


3 


9 


36 


90 


180 


315 


504 


756 


1,080 


4 




16 


80 


240 


560 


1,120 


2.016 


3.360 


5 






25 


150 


525 


1,400 


3,150 


6,300 


6 








36 


252 


1,008 


3,024 


7.560 


7 










49 


392 


1,764 


5.880 


8 












64 


576 


2,880 


9 














81 


810 


10 
















100 



Each test as above will give rise to a particular hybridisation 

5 'signature'. 

TRSPA-1 TRSPA-2 
(for n=6 and r=3) (for n=6 and r=3) 

H I I I I nn I I II i i ■ i m i i m i nj 
I ■ I ■n I ] I J 1 1 1 1 |H| mil 1 1 m 
lDj CED Bui CIDulIDH 

LMJ ri 1 1 1 1 1 1 m ill mm 1 i ri QD 

Mi I iTm rrl »T1 |R1 I II MJ| |JJ 

QIjiJDQQDIj lED lju tJD ujj 
nri rrn 1 1 ii i ii j uu FTQ HO QD 
52 99 90 EE BmSfluiBBB 

HHfflH fBSHHIB 
ocnzjctn ce op eb Pj 

QD n I I IT1 I I I I I ■ i i I m Vgn nn 

■ t i n 1 1 rn mi in 1 1 ■ ■ 1 1 Qj| 
]Q uU QjI uXl uD luj MH lHj oO 

There will be {1+2+., .+r} = {r(r+1)/2} patterns per (rxr) test 
matrix. If there are n C r (rxr) test matrices, the total possible number of 
signatures will be given by {r(r+1)/2} raised to the power n C r . We require 
15 that the number of such different signatures is greatly in excess over the 
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number of fragments generated upon fragmentation. 

If we choose n=6 and r=3, there will be 6 possible 
hybridisation patterns per test and 20 possible tests - giving a combinatorial 
diversity of 6 20 - 3.7-1 0 15 . The preferred scheme therefore employs six 
5 enzymes, tested in groups of three - giving 180 spots per experiment In 
order to avoid restriction fragment length polymorphism problems, a 
duplicate analysis could be performed with a different set of enzymes 
sharing no cutting sites in common with the first set. 

to Example restriction enzyme sets for the preparation of test matrices 
for TRSPAs 

Criteria for enzyme choice 

In order to make TRSPAs, the selection of suitable enzymes 
15 is an important factor. Ideally, two sets of different enzymes are required to 
eliminate the small possibility that a phenotype-determining polymorphism 
might fall within a chosen restriction site and therefore compromise the 
specificity of the resulting signature. The selection of enzymes can be 
based upon a number of criteria 
20 a) The enzymes should be 6 bp cutters. 

b) Cleavage by any selected enzyme should leave a 4 bp 
overhang at the 5' end. 

c) The selected enzymes in each set should all work efficiently 
under the same buffer conditions. 

25 d) The selected enzymes in each set should ideally work 

efficiently at a single incubation temperature. 

e) The chosen enzymes should be commercially available - 
ideally at concentrations of 1 0 U / jil or more. 

f) The 5' overhangs left by any two enzymes in the same set 
30 should not be identical. 

g) No enzyme should appear in both sets for TRSPA fabrication. 
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h) Enzymes should be selected to avoid or minimise the effects 

of mammalian methytation patterns. In particular, enzymes with CG 
dinucleotides in their recognition sites should be avoided unless the 
enzyme is known to be able to restrict m5 CpG sites. 

DNA rnethylation 

In vertebrates, DNA is often methylated at the 5 th position of 
cytosine in the sequence of CpG and this is the only chemical modification 
that DNA of vertebrates contains under physiological conditions. By the 

m careful selection of enzymes which do not contain CpG sequences within 
the recognition site, or the selection of enzymes which freely restrict m5 CpG 
methylated sites, it is possible to remove the potentially adverse effects of 
DNA rnethylation from the TRSPA analysis. 6 bp cutters which are known 
to restrict m5 CpG modified DNA efficiently to leave a 5' four base overhang 

is are BspEI and XmaL These enzymes are therefore the only enzymes with 
restriction sites containing CpG dinucleotides that are potentially useful in a 
TRSPA analysis. 

Enzyme selection method 

20 Sixteen possible four base pair overhangs exist (excluding 

unusual enzymes with asymmetrical recognition sequences such as BssSI 
or Bsil), five of which contain CG in the sequence. A further four 
overhangs could potentially contain CG sequences within the restriction 
recognition site if preceded by a C and followed by a G. Enzymes are 

25 therefore preferentially selected from the remaining six groups. 

Excluding isoschizomers, there are up to four possible 
enzymes which would leave a particular 5' four bp overhang. For example, 
enzymes leaving a CTAG overhang are 



30 
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Avrll - CCTAGG 

Nhel - GCTAGC 

Spel - ACTAGT 

Xbal - TCTAGA 

Enzymes to cleave sites with all the combinations of flanking 
bases are not available for ail overhangs - hence the enzyme choice is 
more limited for some overhang groups than others. 

As a primary step towards enzyme selection, the enzymes 

10 supplied by Amersham Pharmacia Biotech, New England Bioiabs and 
Promega are ordered below by overhang sequence. Supplementary 
details such as the percentage activity in common buffers, the reaction 
temperature, concentration and supplier are also recorded. For three 
overhang sequences, there are no available enzymes, the remaining 13 

15 are described below. Enzymes considered unsuitable due to methylation 
sensitivity are shaded (darker shade). Enzymes considered unfavourable 
due to the presence of CpG sites even though they do restrict methylated 
DNA to some degree are also shaded (lighter shaded). 

20 Candidate restriction enzymes leaving a 5 l overhang of GTAC 



Enzyme 


Site 


H/M/T buffer +BSA 
% activity 


Optimu 
m°C 


\Jf\l\ 


Supplier 


inactivate 
°C/min 


Acc65l 


GGTACC 


10/50/100 


37 


10 


NEB 


65/20 


. BsiWI 


CGTACG 


100(50}/100{50)/25(-) 


55.(37}, 








BsrGI 


TGTACA 


10/100/100 


37 


10 


NEB 


80/20 


Spll 

(8siwiy± 


CGTACG 




. .., 55:---, 


& 10- v 
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Candidate restriction enzymes leaving a 5* overhang of TTAA 



Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m ; C 


U/ui 


Supplier 


inactivate 
c C/min 


AMI 


CTTAAG 


25/100/100 


37 


10 


APB 


60/15 


Candidate restriction enzymes leaving a 5 l overhang of AATT 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m C C 


U/jil 


Supplier 


inactivate 
°C/min 


EcoRl 


GAATTC 


100/100/100 


37 


>40 


APB/NEB 


65/20 


Mfel 


CAATTG 


10/50/100 


37 


10 


NEB 


65/20 


Candidate restriction enzymes leaving a 5 1 overhang of CATG 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m°C 


U/fit 


Supplier 


Inactivate 
°C/mtn 


BspHI 


TCATGA 


50/100/100 


37 


10 


NEB/APB 


65/20 


Ncoi 


CCATGG 


100/100/100 


37 


50 


NEB/APB 


65/20 


Candidate restriction enzymes leaving a 5' overhang of GATC 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m°C 


U/\i\ 


Suppiier 


inactivate 
°C/min 


BamHI 


GGATCC 


50/100/75 


37 


>40 


APB/NEB 


80/20 


Bell 


TGATCA 


100/100/75 


50 (37) 


10 


NEB 


No 


Bgtft 


AGATCT 


100/75/10 


37 


>40 


APB/NEB 


No 



WO 00/55364 



PCT/GBOO/009I6 



-57- 



Candidate restriction enzymes leaving a 5' overhang of CCGG 



Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m C C 


U/ui 


Supplier 


Inactivate 


BspE! 


TCCGGA 


100/10/0 


37 


10 


NEB 


80/20 


NgoMtV 


GCCGGC 


10/50/100 


37 


10 


NEB 




Xmai 


CCCGGG 


0/100/100 


37 


10 


NEB 


65/20 


Candidate restriction enzymes leaving a 5 l overhang of GCGC 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m 3 C 


U/ul 


Supplier 


inactivate 
c C/mtn 


Kasl 


GGCGCC 


75/100/75 


37 


5 . 


NEB, 




Candidate restriction enzymes leaving a 5 r overhang of TCGA 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m°C 


U/ul 


Supplier 


Inactivate 
°C/min 


PaeR7l 


CTCGAG 


10/100/100 . 


37 


20, 


NEB;f: v 




Sail 


GTCGAC 


50/0/0;-, 


37 


>40 


APB/NEB 




Xhol 


CTCGAG 


100/100/100 


37 


>40 


APB/NEB 





jo Candidate restriction enzymes leaving a 5 1 overhang of AGCT 



Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m°C 


U/ui 


Supplier 


inactivate 
c C/min 


Hindfif 


AAGCTT 


10/100/50 


37 


>40 


NEB/APB 


65/20 



WO 00/55364 



PCT/GBOO/00916 



-58- 

Candidate restriction enzymes leaving a 5 1 overhang of CGCG 



Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m Z C 


U/ut 


Supplier 


Inactivate 
c C/min 


BssHI! 


GCGCGC 


100/100/100 


50 


20 


NEB/APB 




Mlul 


ACGCGT 


100/75/50 


37 


10 


APB/NEB 




Candidate restriction enzymes leaving a 5' overhang of GGCC 


Enzyme 


Site 


H/MH" buffer + BSA 
% activity 


Optimu 
m "C 


U/nl 


Supplier 


Inactivate 
°C/min 


Eagl 


CGGCCG 


100/25/10 


37 


50 






■*> ^^^^^^ f 


Eco52l 
(Eagl) 


CGGCCG 


20/20/20 


37 


10, 


.. APS^ 






Candidate restriction enzymes leaving a 5' overhang of TGCA 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m C C 


U/uj 


Supplier 


Inactivate 
°C/min 


ApaLl 


GTGCAC 


10/100/100 


37 


10 


NEB/APB 


70/15 


Aiw44l 


GTGCAC 


-/50/- 


37 


10 


Promega 


70/15 


Candidate restriction enzymes leaving a 5 1 overhang of CTAG 


Enzyme 


Site 


H/M/T buffer + BSA 
% activity 


Optimu 
m°C 


Ui\x\ 


Supplier 


Inactivate 
°C/min 


Avrif 


CCTAGG 


50/100/100 


37 


4 


NEB 


No 


Nhel 


GCTAGC 


10/100/100 


37 


10 


APB/NEB 


65/20 


Spei 


ACTAGT 


25/100/75 


37 


50 


NEB 


65/20 


Xbal 


TCTAGA 


75/100/75 


37 


100 


NEB 


65/20 


Blnl(Avrfl) 


CCTAGG 


40/20/20 


37 


10 


APB 


No 
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From the above tables, a short-list of the most useful 
enzymes is given below for each of the buffer conditions shown 



Overhang 


Buffer H + BSA 


Buffer M + BSA 


Buffer T + BSA 




Enzyme 


Efficiency 


Enzyme 


Efficiency 


Enzyme 


Efficiency 


AATT 


EcoRI 


100 


EcoRI 


100 


Mfel 


100 








Mfel 


50 


- 


- 


CATG 


Ncol 


100 


Ncol 


100 


Ncol 


100 




BspHI 


50 


BspHI 


100 


BspHI 


100 


GATC 


Bglll 


100 


BamHI 


100 


BamHI 


75 




BamHI 


50 


Bglll 


75 


- 


• 


TATA 


- 


- 


- 


- 


- 


- 


ACGT 


- 


- 


- 


- 


- 


- 


CCGG 


BspEI 


100 


Xmal 


100 


Xmal 


100 


GCGC 


- 


- 


- 


- 


- 


- 


TCGA 


- 


- 


- 


- 


- 


- 


AGCT 


- 


- 


Hindllf 


100 


Hindlll 


50 


CGCG 


- 


- 


- 


- 


- 


- 


GGCC 














TGCA 






ApaLI 


100 


ApaLI 


100 


ATAT 














CTAG 


Xbal 


75 


Xbal 


100 


Nhel 


100 




Avrll 


50 


Spel 


100 


Avrll 


100 




Blnl 


40 


Nhel 


100 


Xbal 


75 








Avrll 


100 


Spel 


75 


GTAC 


Acc651 


100 


BsrGI 


100 


BsrGI 


100 








Acc65l 


75 






TTAA 






Aflll 


100 


Aflll 


100 



Taking into account ail of the criteria explained above, an 
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example selection of two six-enzyme sets is described below. Reserve 
enzymes, which could also be used, are shown as well. These reserve 
enzymes can be substituted (provided this does not cause overhang 
duplication) if practical problems regarding enzyme availability or 
5 performance should occur. 

Example enzyme set 1 - for restriction in buffer M + BSA 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 
C C 


U/ni 


Supplier 


Inactivate 
°C/min 


1 


BamHl 


GGATCC 


100 


37 


100 


NEB 


80/20 


2 


BsrGl 


TGTACA 


100 


37 


10 


NEB 


80/20 


3 


Htndiii 


AAGCTT 


100 


37 


>40 


APB 


65/20 


4 


Ncot 


CCATGG 


100 


37 


50 


NEB 


65/20 


5 


Spef 


ACTAGT 


100 


37 


50 


NEB 


65/20 


6 


Aflll 


CTTAAG 


100 


37 


10 


APB 


60/15 



io Example enzyme set 2 - for restriction in buffer M + BSA 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 
C C 


U/jd! 


Supplier 


Inactivate 
°C/min 


1 


EcoRI 


GAATTC 


100 


37 


>40 


APB 


65/20 


2 


BspHf 


TCATGA 


100 


37 


10 


APB 


65/20 


3 


Bglil 


AGATCT 


75 


37 


>40 


APB 


No 


4 


Xbal 


TCTAGA 


100 


37 


100 


NEB 


65/20 


5 


Acc65l 


GGTACC 


75 


37 


10 


NEB 


65/20 


6 


ApaL1 


GTGCAC 


100 


37 


10 


NEB 


No 
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1 . Example reserve enzymes 



Number 


Enzyme 


Site 


M+BSA 

% 
activity 


Optimum 

C C 


U/ul 


Supplier 


inactivate 
c C/min 


r1 


Xmal 


CCCGGG 


100 


37 


10 


NEB 


65/20 


r2 


BspEI 


TCCGGA 


100 


37 


10 


NEB 


80/20 


r3 


Nhel 


GCTAGC 


100 


37 


10 


APB 


65/20 


r4 


Avrli 


CCTAGG 


100 


37 


4 


NEB 


No 



Example specific (3x3) terminal restriction site profiling array (TRSPA) 
5 test matrices for above two sets of six enzymes 



Triplet combinations from BamHI, BsrGI, HindHI, Ncol, Spel and Aflil 

For example set 1 - BamHI, BsrGI, Hindlli, Ncol, Spel and 
Aflll - the 20 (= 6 C 3 ) triplet combinations are 



BamHf BsrGi HindHI 
BamHI BsrGI Ncol 
BamHI BsrGI Spel 
BamHI BsrGI Aflll 
BamHI Hindlli Ncol 



BamHI HindHI Spel 
BamHI HindHI Aflll 
BamHI Ncol Spel 
BamHI Ncol Aflll 
BamHI Spel Aflll 



BsrGI HindHI Ncol 
BsrGI HindHI Spel 
BsrGI HindHI Aflll 
BsrGI Ncol Spel 
BsrGI Ncol Aflll 



BsrGI Spel Aflll 
Hindlli Ncol Spel 
HindHI Ncol Aflll 
Hindlli Spel Aflll 
Ncol Spel Aflll 



15 



Triplet combinations from EcoRI, BspHl, Bglll, Xbal, Acc65l and ApaU 

For example set 2 - EcoRI, BspHl, Bglll, Xbal, Acc65l and 
ApaLi- the 20 (= 6 C 3 ) triplet combinations are 



EcoRI BspHl Bglll 
EcoRI BspHl Xbal 
EcoRI BspHl Acc65I 
EcoRI BspHl ApaU 
EcoRI Bglll Xbal 



EcoRI Bgifl Acc65I 
EcoRI Bglll ApaU 
EcoRI Xbal Acc65I 
EcoRI Xbal ApaU 
EcoRI Acc65l ApaU 



BspHl Bglll Xbal 
BspHl Bglll Acc65l 
BspHl Bglll ApaU 
BspHl Xbal Acc65l 
BspHl Xbal ApaU 



BspHl Acc65i ApaU 
Bglll Xbal AccSBl 
Bglll Xbal ApaU 
Bglll Acc65l ApaU 
Xbal AccSSl ApaU 
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Example TRSPA-1 test matrices for set 1 - BamHI, BsrGI, Hindlll, Ncol, 
Spel and AMI 



matrix 1 


5"-HO-SamH/-pnmer 


S'-HQ'BsrGI-pnmer 


5'-HO-H//7d//Aprtmer 


5'-biotirvSamH/-pnmer 


1 


2 


3 


5'-biotin-SsfG/-pnmer 


4 


5 


6 


5'-biotin-H/nrf///-prumer 


7 


8 


9 



matrix 2 


S'-HO-SamH/'primer 


5 , -HO-0srG/-pnmer 


S'-HO-NcoZ-primer 


5'-biotin-8amH/-pnmer 


1 


2 


3 


5'-biotin-£srt3/-primer 


4 


5 


6 


S'-biotin-NcoAprimer 


7 


8 


9 



5 



matrix 3 


5'-HO-Ban?H/-pnmer 


S'-HO-BsrGi-primer 


S'-HO-SpeZ-primer 


5'-biOtin-Ba/77H/-primer 


1 


2 


3 


5'-biotin-BsrG/-pnmer 


4 


5 


6 


S'-biotin-SpeZ-primer 


7 


8 


9 



matrix 4 


5'- HO* BamHt-p rimer 


5'-HO-Bsrt3/-primer 


5'-HO->4////-primer 


5'-biotin-BamH/-primer 


1 


2 


3 


5 1 -biotin-Ss/'G/-primer 


4 


5 


6 


5'-biotin-Atf//-prrmer 


7 


8 


9 



matrix 5 


5'-HO-BamH/-pnmer 


5'-HO-H/nd///-primer 


S-HO-Ncol-primer 


5'-biotin-BamH/-pnmer 


1 


2 


3 


5 T -biotin-H/ncr///-primer 


4 


5 


6 


5'-biotin-/Vco/*pnmer 


7 


8 


9 



matrix 6 


5'-HO-SamH/-primer 


5'-HOH/ncW//-pnmer 


5'-HO-Spe/-primer 


5*-biotin-SamH/-pnmer 


1 


2 


3 


5'-biotin-tf/nd///-pnmer 


4 


5 


6 


S'-biotin-Spe/'primer 


7 


8 


9 



matrix 7 


5'-HO-BamH/-prtmer 


5 , -HO-H//?cf///-pnmer 


S'-HO-iAfl/y-pnmer 


S'-biotin-BarnHZ-primer 


1 


2 


3 


5'-biotin-H/nctf//-pnrner 


4 


5 


6 


5'-biotinv4f///-pnmer 


7 


8 


9 



10 



WO 00/55364 



PCT/GBOO/00916 



-63- 



matrix 8 


5 v HO-8amH/-primer 


5-HO-/Vco/-pnmer 


5'-HOSpe/-pnmer 


S'-biotin-SamH/'prtmer 


1 


2 


3 


5'-biotin-/Vco/-pnmer 


4 


5 


6 


5"-biotin-Spe/-primer 


7 


8 


9 



matrix 9 


5"-HO-BafnH/-primer 


S'-HO-A/coAprimer 


5'-HO-/4tf//-primer 


5'-biotin-SamH/-primer 


1 


2 


3 


5'-biotin-/Vco/-prtmer 


4 


5 


6 


5'-biotin-Atf//-primer 


7 


8 


9 



matrix 10 


5*- HO- BamHI- primer 


5'-HOSpe/-prtmer 


S'-HO-yW/Aprimer 


5'-biotin-BamH/-pnmer 


1 


2 


3 


5'-biotin-Spe/-pnmer 


4 


5 


6 


5'-biotin-/A////-pnmer 


7 


8 


9 



matrix 1 1 


5'-HO-SsrG/-primer 


5'-HO-H/ncWApnmer 


5*-HO-/VcoApnmer 


S'-biotm-BsrGAprimer 


1 


2 


3 


5'-biotin-H/nd///-primer 


4 


5 


6 


5'-biotin-/VcoApnmer 


7 


8 


9 



matrix 12 


S'-HO-BsrGApnmer 


S'-HO-H/nctf/Aprimer 


S'-HO-SpeApnmer 


5'-bioti n-BsrGAprirner 


1 


2 


3 


5'-biotin- H/nd//Apnmer 


4 


5 


6 


5'-biotin-SpeApnmer 


7 


8 


9 



5 



matrix 13 


S'-HO-BsrGAprimer 


5'-HO-H/nGM/Apnmer 


S'-HO-yW/Aprimer 


S'-biotm-BsrGAprimer 


1 


2 


3 


5'-biotin-A//nd//Aprimer 


4 


5 


6 


5'-biotin-/tf//Apnmer 


7 


8 


9 



matrix 14 


5'- HO- BsrG Aprimer 


5*-HO-AycoAprimer 


5'-HO-SpeAprimer 


S'-biotin-BsrGApnmer 


1 


2 


3 


5'-biotin-/VcoApnmer 


4 


5 


6 


S'-biotin-SpeAprimer 


7 


8 


9 



matrix 1 5 


5 y -HQ-BsrG /-primer 


5'-HO-NcoAprimer 


5'-HO-/Afl//-pnmer 


5'-biotin-SsrGApnmer 


1 


2 


3 


S'-biotin-NcoAprimer 


4 


5 


6 


5'-biotin-/4f///-prtmer 


7 


8 


9 
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matrix 16 


5*-HO-BsrG/-pnmer 


5'-HO-Spe/-pnmer 


5'-HO-4////-pnmer 


5'-biottn-BsrG/-pnmer 


1 


2 


3 


5'-biotin-Spe/-primer 


4 


5 


6 


5 , -biotin-i4////-primer 


7 


8 


9 



matrix 17 


5*-HO-H/nd///-pnmer 


5"-HO-A/co/-pnmer 


5'-HO-Spe/-pnmer 


5'-biotin-H/nd///-pnmer 


1 


2 


3 


5'-biotin-A/co/-pnmer 


4 


5 


6 


5'-biotin-Spe/-pnmer 


7 


8 


9 



matrix 18 


5'-HO-H/nc////-primer 


5'-HO-A/co/-pnmer 


5'-HCMtf//-primer 


5'-b*otin-H//7d///-primer 


1 


2 


3 


S'-biotirv/VcoZ-prirner 


4 


5 


6 


5'-b)Ot]n-^///y-pnmer 


7 


8 


9 



matrix 19 


S'-HO-H/nof/rt-primer 


5'-HO-Spe/-primer 


5'-HCMtf//-pnmer 


5'-biotin-H/nd///-primer 


1 


2 


3 


5'-biotin-Spe/-primer 


4 


5 


6 


5*-biotin->4//i/-primer 


7 


8 


9 



matrix 20 


5'-HO-/Vco/-pnmer 


5'-HO-Spe/-primer 


5'-HCMf///-primer 


S'-biotin-A/co/'primer 


1 


2 


3 


S'-biotin-SpeZ-primer 


4 


5 


6 


5'-biotin-/W/-pnmer 


7 


8 


9 



5 



Example TRSPA-1 test matrices for set 2 - EcoRI, BspHI, Bgltt, Xbal, 
Acc65l and ApaLI 



matrix 21 


5'-HO-£cofl/-primer 


5'-HO-SspH/-primer 


S'-HO-SyZ/Aprimer 


S'-biotin-Ecoffl-prmner 


1 


2 


3 


S'-biotin-BspHZ-primer 


4 


5 


6 


5 , -biotm-Bgf///-pnmer 


7 


8 


9 



matrix 22 


5'- HO- Ecotf /-primer 


5'-HO-BspH/-primer 


5'-HO-Xba/-pnmer 


5'-biotin-Ecofl/-prirrier 


1 


2 


3 


5'-biotin-SspH/-prtmer 


4 


5 


6 


5'-biotin-X£>a/-pnmer 


7 


8 


9 
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5 



matrix 23 


5"-HO-EcoR/"pnmer 


5'-HO-BspH/-primer 


5'-HCMcc65/-pnmer 


5'-biotin-£cotf/-primer 


1 


2 


3 


5'-biotin-0spH/-pnmer 


4 


5 


6 


5'-biotin-/4cc65/-pnmer 


7 


8 


9 




matrix 24 


5'-HO-£cofl/-primer 


5'-HO-SspH/-pnmer 


5"-HO-ApaL/-pnrner 


B'-biotin-EcoflAprirner 


1 


2 


3 


S'-biottn-SspHAprimer 


4 


5 


6 


5*-biotin-ApaL/-primer 


7 


8 


9 




matrix 25 


5*-HO-£coP/-pnmer 


5'-HO-8p///-primer 


5'-HO-X£>a/-pnmer 


5'-biotin-£coft/-primer 


1 


2 


3 


5*-biotin-Sg///-primer 


4 


5 


6 


5'-biotin-X6a/-pnmer 


7 


8 


9 




matrix 26 


5'-HO-£cofi/-primer 


5'-HO-Bp///-primer 5'-HO-Acc65i~pr\mer 


5'-btotin-£cofl/-prtrner 


1 


2 


3 


5'-biotin-09///-pnmer 


4 


5 


6 


5*-biotin->Acc65/-pnmer 


7 


8 


9 






matrix 27 


S'-HO-EcoflZ-primer 


S'-HO-Sg/ZZ-primer 


5'-HCMpaL/-primer 


S'-biotin-Eco^-primer 


1 


2 


3 


5'-biotm-Bp///-primer 


4 


5 


6 


S'-biotm-zApa/. /-primer 


7 


8 


9 




matrix 28 


S'-HO-EcoR /-primer 


S'-HO-Xba/'primer 


5'-HO-/4cc65/-pnmer 


S'-biotm-Ecotf /-primer 


1 


2 


3 


5'-biotm-Xba/-primer 


4 


5 


6 


S'-biotin-AccSS /-primer 


7 


8 


9 


matrix 29 


5*-HO-£coR/-pnmer 


5'-HO-Xfca/-primer 


5'-HCMpa/-/-primer 


5'-biotin-£cof?/-pnmer 


1 


2 


3 


5'-biotin-X£>a/-primer 


4 


5 


6 


5' -biotin-yApaL /-primer 


7 


8 


9 




matrix 30 


5'-HO-£cofi/-pnmer 


5'-HO-Acc65/-primer 


5'-HCMpaL/-pnmer 


5' -biotin-Ecotf /-primer 


1 


2 


3 


5'-biotin-,4cc65/*pnmer 


4 


5 


6 


5 , -biotin->ApaL/-primer 


7 


8 


9 
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matrix 31 


5'-HO-SspH/-primer 


5'-HO-6g///-primer 


S'-HO-XfcaApnmer 


5'-biottn-BspH/-p rimer 


1 


2 


3 


5-to!Otin-8g///-pnmer 


4 


5 


_6 


5'-biotin-Xjba/-pnmer 


7 


8 


9 



matrix 32 


5"-HO-BspH/-primer 


5'-HO-£p///-pnrner 


5'-HO-^ccff5/-pnmer 


5'-b*otirvBspH/-pnmer 


1 


2 


3 


5'-biotin-Bgr//Apnmer 


4 


5 


6 


5"-biotin~^cc65/-primer 


7 


8 


9 



matrix 33 


5'-HO-BspH/-pnmer 


o -HO-6(?///-primer 


5'-HCMpaL/-pnmer 


5*-bioun-BspH/-pnmer 


1 


2 


3 


5'-bionn-Bg///-primer 


4 


5 


6 


5'-b!Otin-^paL/-pnmer 


7 


8 


9 



matrix 34 


S'-HO-BspHZ-primer 


S'-HO-Xba/'primer 


5'-HO-4cc65/-pnmer 


5'-biotin-BspH/-pnmer 


1 


2 


3 


S'-biottn-XbaJ-pnmer 


4 


5 


6 


5'-biot!n-4cc65/-pnmer 


7 


8 


9 



matrix 35 


S'-HO-BspHZ-primer 


S'-HO-XfcaZ-primer 


5'-HO-/4paL/-pnmer 


5*-biotin-Bsptf/-pnmer 


1 


2 


3 


5'-biotirvXba/-primer 


4 


5 


6 


S'-biotirMpa/. /-primer 


7 


8 


9 



5 



matrix 36 


5'-HO-8spH/-primer 


5*-HCMcc65/-pnrner 


5'-HO-4pai/-pnmer 


5'-biotin-BspH/-primer 


1 


2 


3 


5'-biotm-Accff5Apnmer 


4 


5 


6 


5'-biotin-dpaLApnmer 


7 


8 


9 



matrix 37 


5'-HOBgr///-pnmer 


5 , -HO-Xjba/-pnmer 


5 , -HO-Acc65/-primer 


5'-biotin-Sp///-pnmer 


1 


2 


3 


5'-biotin-Xba/-pnmer 


4 


5 


6 


5 , -btotin-Acc65/-pnmer 


7 


8 


9 



matrix 38 


B'-HO-SpW-prtmer 


S'-HO-X^aApnmer 


5'-HCMpa/-/-primer 


5'-biotin-B<?///-primer 


1 


2 


3 


5'-biotin-Xba/-primer 


4 


5 


6 


5'-biotjn-/\pat/-pnmer 


7 


8 


9 
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matrix 39 


5'-HO-Bg///-prtmer 


5'-HCMcc65/-pnmer 


5'-HO-ApaL /-primer 


5'-biotin-flg///-pnmer 


1 


2 


3 


5'-biotin->4cc65/-primer 


4 


5 


6 


5'-biotin- ApaL /-primer 


7 


8 


9 



matrix 40 


$'-HQ-Xba /-primer 


S'-HO-AccSSZ-primer 


5'-HO-/4pa/>pnmer 


5*-biotin-X£?a/-pnmer 


1 


2 


3 


5*-bioti n-Accf55/- primer 


4 


5 


6 


5'-biotin-y4paU-pnmer 


7 


8 


9 



Example TRSPA-2 test matrices for set 1 - BamHI, BsrGI, Hindlll, Ncol, 
Spel and Aflll 

5 



matrix 1 


5'-HO- BamH/-pnmer 


5'-HO-esrG/-pnmer 


5'-HO-H/ntf///-primer 


5'-HOSamH/-primer 


1 


2 


3 


5'-HO-BsrG/-prmner 


4 


5 


6 


5'-HO-H//?d///-prirrier 


7 


8 


9 



matrix 2 


5'-HO-8amtf /-primer 


5'-HO- BsrG/-p rimer 


5'-HO-/Vco/-pnmer 


5'-HO-BamH/-pnmer 


1 


2 


3 


5'-HO-SsrG/-pnmer 


4 


5 


6 


5'-HO-A/co/-pnmer 


7 


8 


9 



matrix 3 


S'-HO-earnHZ-primer 


5'-HO-SsrG/-pnmef 


5'-HO-Spe/-pnmer 


5'-HO-BamH/-pnmer 


1 


2 


3 


5'-HO-BsrG/-pnmer 


4 


5 


6 


S'-HO-Spe/'primer 


7 


8 


9 



matrix 4 


S'-HO-BamHAprimer 


5'-HO-8srG/-primer 


5'-HO->WWApnmer 


5'- HO- Ba/nHA primer 


1 


2 


3 


5'-HO-0srG/-primer 


4 


5 


6 


S'-HO/tf/tf-primer 


7 


8 


9 



matrix 5 


5'-HO- Ban?H/-prt mer 


5'-HO-H/nd///-primer 


5'-HO-Afco/-pnmer 


5'-HO-BamH/-primer 


1 


2 


3 


5'-HO-H/hcf//Apnmer 


4 


5 


6 


5'-HO-Nco/-pnmer 


7 


8 


9 
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matrix 6 


S'-HO-BamHI-pnmer 


5'-HO-H/nc////-primer 


5"-HO-Spe/-pnmer 


5'-HO-SamH/-pnmer 


1 


2 


3 


5'-HO-H/nctf/f-pnmer 


4 


5 


6 


5'-HO-Spe/-primer 


7 


8 


9 



matrix 7 


5'-HO-SamH/-pnrner 


5'-HO-H/nd///-pnmer 


5'-HO-/tf///-pnmer 


5'-HO-SarnHi-prirner 


1 


2 


3 


5'-HO-H/nd///-pnmer 


4 


5 


6 


5 T -HCM/W-pnmer 


7 


8 


9 



matrix 8 


5*-HO-BamH/-pnmer 


5'-HO-/Vco/-pnmer 


5'-HO-Spe/-primer 


5*-HO-SamH/-primer 


1 


2 


3 


5'-HO-Nco/-pnmer 


4 


5 


6 


5'-HO-Spe/-primer 


7 


8 


9 



matrix 9 


5'-HO-Bamtf/-primer 


5'-HO-/Vco/-pnmer 


S'-HCMfl/f-pnrner 


S'-HO-BarnHZ-primer 


1 


2 


3 


5'-HO-Nco/-primer 


4 


5 


6 


5'-HO-i4fl//-primer 


7 


8 


9 



matrix 10 


S'-HO-SarnHZ-primer 


5'-HO-Spe/-pnmer 


5*-HO-/4////-pnmer 


5'~HO-Bamrt/-pnmer 


1 


2 


3 


B'-HO-SpeAprimer 


4 


5 


6 


5 ! -HO-/4r7//-pnmer 


7 


8 


9 



5 



matrix 1 1 


5'-HO- Bsr GZ-primer 


5"-HO-H//7Ctf//-pnmer 


B'-HO-ZVco/'primer 


S'-HO-SsfGAprimer 


1 


2 


3 


5'-HO-H/nc(W/-primer 


4 


5 


6 


5'-HO-/Vco/-primer 


7 


8 


9 



matrix 12 


5 , -HO-SsrG/-pnmer 


5*-HO-tf/nd///-pnmer 


5'-HO-Spe/-pnmer 


5'-HO-Ss/'G/-primer 


1 


2 


3 


S'-HO-H/ndZ/y-primer 


4 


5 


6 


5'-HO-Spe/-primer 


7 


8 


9 



matrix 1 3 


5 , -HO-Ss/'G/-pnmer 


5*-HO-H/nd//Aprimer 


5'-HO-/tf///-primer 


5'-HO-BsrG/-primer 


1 


2 


3 


5'-HO-H/ncW/-pnmer 


4 


5 


6 


S'-HO-AfflJ-primer 


7 


8 


9 
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matrix 14 


5'-HO-BsrG/-pnmer 


s -HO-A/co/-pnmer 


5"-HO-Spe/-pnmer 


5*-HO-Bsrt3Aprimer 


1 


2 


3 


5'-HO-A/co/-pnmer 


4 


5 


6 


5'-HO-Spe/-pnmer 


7 


8 


9 



matrix 15 


5'-HO-BsfG/-pnmer 


5'-HO-Afco/-prtmer 


5'-HO-Afl//-primer 


5'-HOBsrG/-pnmer 


1 


2 


3 


5"-HO-A/co/-pnmer 


4 


5 


6 


5'-HO-Atf//-pnmer 


7 


8 


9 



matrix 16 


5'-HO-Bs/"G/-primer 


5'-HO-Spe/-pnmer 


S'-HO-AfflAprimer 


5'-HO-BsrG/-pnmer 


1 


2 


3 


5'-HOSpe/-pnmer 


4 


^ 5 


6 


5-HO-A////-pnmer 


7 


8 


9 



matrix 17 


S'-HO-HjrtcWAprimer 


5'-HO-/Vco/-primer 


5'-HO-SpeApnmer 


5'-HO-tf/nd///-primer 


1 


2 


3 


S'-HO-AfcoAprimer 


4 


5 


6 


S'-HO-SpeApnmer 


7 


8 


9 



matrix 18 


5"-HO-H/nof//Apnmer 


5'-HO-/VcoApnmer 


5'-HOAfl/Aprimer 


5'-HO-H/ntf//Aprimer 


1 


2 


3 


5"-HO-A/coApnmer 


4 


5 


6 


5*-HO-Atf/Apnmer 


7 


8 


9 



5 



matrix 19 


5'-HO-H/ncf///-primer 


S'-HO-SpeApnmer 


S'-HO-Afl/Aprimer 


S'-HO-Hwrt/Aprimer 


1 


2 


3 


5'-HO-SpeAprimer 


4 


5 


6 


5'-HO-A///Aprimer 


7 


8 


9 



matrix 20 


5'-HO-A/coApnmer 


5'-HO-Spe/-primer 


5'-HO-A///Aprimer 


5'-HO-A/coAprimer 


1 


2 


3 


S'-HO-SpeAprimer 


4 


5 


6 


5'-HO-A///Apnmer 


7 


8 


9 
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Example TRSPA-2 test matrices for set 2 - EcoRl BspHI, BgllL Xbai, 
Acc65l and ApaLl 



matrix 21 


5"-HO-Ecof?/-pnmer 


5"-HO-BspH/-pnmer 


5*-HO-Bg///-primer 


5*-HO-EcoH/-pnmer 


1 


2 


3 


5"-HO-6spH/-primer 


4 


5 


6 


5'-HO-So/J/-pnmer 


7 


8 


9 



matrix 22 


5*-HO-EcoR/-pnmer 


B'-HO-SspHZ-primer 


5'-HO-X6a/-primer 


5'-HO-EcoFr/-primer 


1 


2 


3 


5'-HO-BspH/-primer 


4 


5 


6 


5"-HO-Xba/-pnmer 


7 


8 


9 



5 



matrix 23 


o'-HO-EcoflZ-prtmer 


5'-HO- BspH/-primer 


5'-HCMcc65Apnmer 


S'-HO-Ecoflf-pnmer 


1 


2 


3 


5*-HO-SspH/-primer 


4 


5 


6 


5 , -HO-/4cc65/-pnmer 


7 


8 


9 



matrix 24 


5'-HO-Eco/?/-pnmer 


5'-HO-BspH/-pnmer 


5'-HCMpaL/-primer 


S'-HO-Ecoff /-primer 


1 


2 


3 


5'-HO-SspH/-pnmer 


4 


5 


6 


5'-HO-/*paL/-pnmer 


7 


8 


9 



matrix 25 


5'-HO£cotf /-primer 


5'-HO-Bp///-pnmer 


5'-HO-X6a/-pnmer 


5'- HO- EcoRh primer 


1 


2 


3 


S'-HO-Bg/ZAprimer 


4 


5 


6 


5'-HOX£ia/-primer 


7 


8 


9 



matrix 26 


5*-HO-Eco/?/-pnmer 


5'-HO-B0W/-pnmer 


5'-HO-4cc65/-primer 


5'-HO-Ecofl/-pnmer 


1 


2 


3 


5'-HO-Bg///-primer 


4 


5 


6 


S'-HO-^cces/'primer 


7 


8 


9 



matrix 27 


5 T -HO-Ecof?/-pnmer 


S'-HO-Bp/tf-primer 


5'-HO->ApaL/-primer 


5'-HO- Eco W-primer 


1 


2 


3 


5 , -HO-Bg///-pnrner 


4 


5 


6 


5*-HO-/4paL/-pnmer 


7 


8 


9 



WO 00/55364 



PCT/GB00/00916 



-71 - 



matrix 28 


5"-HO-Ecof?/-primer 


5'-HO-Xba/-primer 


5'- HO-Acc65i- primer 


5'- HO-Ecofl/- primer 


1 


2 


3 


5'-HO-Xba /-primer 


4 


5 


6 


5'-HO-/4cc65/-pnmer 


7 


8 


9 



matrix 29 


S'-HO-Ecotf /-primer 


5'-HO-Xba/-pnmer 


5'-HO-/4paL/-p rimer 


5'-HO-EcoH/-pnmer 


1 


2 


3 


5'-HO-XPa/-pnmer 


4 


5 


6 


5*-HO-/4paL/-pnmer 


7 


8 


9 



matrix 30 


5'-HO-Eco#/-pnmer 


5 *-HCMcct>5 /-primer 


5'-HO-v4paL/-pnmer 


S'-BO-Ecotf/'primer 


1 


2 


3 


5'- HO-4cc65/- primer 


4 


5 


6 


5'-HO-/4paL/-primer 


7 


8 


9 



matrix 31 


5'- HO- SspH/-primer 


5"-HO-Sp///-primer 


5'-HO-X£?a/-pnmer 


5'-HO-BspH/-pnmer 


1 


2 


3 


5'-HO-Bp///-pnmer 


4 


5 


6 


5'-HO-X£>a/-pnmer 


7 


8 


9 



matrix 32 


5'-HO-BspH/-primer 


5'-HO-Bp///-primer 


5'-HCMcc65/-pnmer 


5' -HO-Bsptf /-primer 


1 


2 


3 


5'-HO-Sgf///-pnmer 


4 


5 


6 


5'-HO->4cc65/-primer 


7 


8 


9 



5 



matrix 33 


5 f - H 0- BspHi-p n me r 


5'-HO-B0///-primer 


S'-HO-^paLApnmer 


5'-HO-BspH/-primer 


1 


2 


3 


5'-HO-Bg///-prtmer 


4 


5 


6 


5'- HO-Apail- primer 


7 


8 


9 



matrix 34 


5 '-HO- BspH/-pnmer 


5'-HO-XjE?a/-primer 


5'- HO-.Acc65/-primer 


5'-HO-BspH/-primer 


1 


2 


3 


S'-HO-XbaZ-primer 


4 


5 


6 


5'-HO-/4cc65/-pnmer 


7 


8 


9 



matrix 35 


5'-HO-BspH/-primer 


5*-HO-X6a/-pnmer 


5'-HO-/4paL/-primer 


5'-HO-BspH/-primer 


1 


2 


3 


5'-HO-X6a/-primer 


4 


5 


6 


S'-HO-zApaLZ-primer 


7 


8 


9 
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matrix 36 


5'-HO-BspH/-pnmer 


5*-HCMcc65/-pnmer 


o'-HO-Apa/J-primer 


5*-HO- BspHI-pnmer 


1 


2 


3 


5 '-HO-Acc65i- primer 


4 


5 


6 


5'-HO-/4paL/-pnmer 


7 


8 


9 



matrix 37 


5*-HO-8p///-primer 


5'-HO-Xda/-pnmer 


5*-HO-/4cc65/-pnmer 


5'-HO-Bflrf//-primer 


1 


2 


3 


5'-HO-X6a/-pnmer 


4 


5 


6 


5'-HCMcc65/-prnmer 


7 


8 


9 



matrix 38 


5"-HO-Bp///-pnmer 


5'-HO-Xfc>a/-primer 


5'-HO-v4pa/./-pnmer 


5"-HO-Bg///-primer 


1 


2 


3 


5'-HO-X6a/-pnmer 


4 


5 


6 


5'-HO-/4paL/-pnmer 


7 


8 


9 



matrix 39 


5 T -HO-Bg///-pnmer 


5 0-y4cc65/-pn mer 


5'-HO-/4paL/-p rimer 


5'-HO-Sg///-primer 


1 


2 


3 


5'-HO->*cc65/-prirTier 


4 


5 


6 


5'-HO-/4paL/-primer 


7 


8 


9 



matrix 40 


5'-HO-Xba/-pnrner 


5'-HO-/4cc65/-primer 


5'-HO-4paL/-pnmer 


5 r - HO- X6a /-primer 


1 


2 


3 


5*-HO->4cc65/-primer 


4 


5 


6 


5'-HO-^pal/-pnmer 


7 


8 


9 



5 



Hybridisation patterns 

There are six possible hybridisation patterns for a given probe 
fragment from inter-popuiation perfectly matched duplex depletion 
enrichment with a (3x3) test matrix in TRSPA-1 analysis, 
io There are also six possible hybridisation patterns for a given 

probe fragment from inter-population perfectly matched duplex depletion 
enrichment with a (3x3) test matrix in TRSPA-2 analysis. 

We can denote these patterns as follows 



15 
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TRSPA-1 analysis 



Pattern a (1 only) 



matrix # 


5'-HO-X-primer 


S'-HO-Y-pnmer 


5'-HO-Z-prirner 


S'-biotin-X-pnmer 




2 


3 


5*-biotm-Y-pnmer 


4 


5 


6 


5'-btotin-Z-primer 


7 


8 


9 


Pattern b (2 and 4) 


matrix # 


S'-HO-X-pnmer 


5'-HO-Y-pnmer 


o'-HO-Z-pnmer 


5*-biotin-X-pnmer 


1 




3 


5 r -biotm-Y-primer 




5 


6 


5'-biotin-Z-pnmer 


7 


8 


9 



Pattern c {3 and 7) 

30 



matrix # 


5'-HO-X-pnmer 


S'-HO-Y-primer 


5'-HO-Z-primer 


5 f ~biotin-X-primer 


1 


2 




S'-biotin-Y-primer 


4 


5 


6 


S'-biotin-Z-pnmer 




8 


9 


Pattern d (5 only) 


matrix # 


S'-HO-X-pnmer 


5'-HO-Y-pnmer 


5'-HO-Z-primer 


S'-biotin-X-pnmer 


1 


2 


3 


S'-biotin-Y-prtmer 


4 


. A f ;. . ■'■ . ■ := i- ■' .' , ' ''• .. 


6 


S'-biotin-Z-pnmer 


7 


8 


9 
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Pattern e (6 and 8) 



5 



matrix # 


5*-HO-X-pnmer 


5'-HO-Y-pnmer 


S'-HO-Z-pnmer 


5*-biotin-X-pnmer 


1 


2 


3 


S'-btotin-Y-pnmer 


4 


5 




5'-biotin-Z-pnmer 


7 






9 | 


Pattern f (9 only) 


matnx # 


5'-HO-X-pnmer 


S'-HO-Y-pnmer 


S'-HO-Z-primer 


5' -biotin-X- primer 


1 


2 


3 


5'-biotin-Y-pnrner 


4 


5 


6 


5'-biotin-Z-primer 


7 


8 





TRSPA-2 analysis 



Pattern a {row 1 and column 1) 



matrix # 
5*-HO-X-prtmer 
S'-HO-Y-pnmer 
5*-HO*Z-pnmer 




Pattern b {2 and 4) 



matrix # 


5-HO-X-primer 


5'-HO-Y*primer 


S'-HO-Z-pnmer 


5'- HO -X-primer 


1 




3 


5'- HO -Y-primer 




5 


6 


5'- HO -Z-pnmer 


7 


8 


9 
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matrix 4 


5-HO-X-pnmer 


5'-HO-Y-pnmer 


5'-HO-Z-pnrner 


5*- HO -X-primer 


1 


2 




5'- HO -Y-primer 


4 


5 


6 


5'- HO -Z-pnmer 




8 


9 



Pattern d (row 2 and column 2) 



matrix # 


5'-HO-X-prirner S'-HO-Y-primer B'-HO-Z-primer 


5'- HO -X-pnmer 






5'- HO -Y-pnmer 




5'- HO -Z-pnmer 




7 ® | 



Pattern e (6 and 8) 



matrix # 


5'-HO-X-pnmer 


S'-HO-Y-primer 


5'-HO-Z-primer 


5'- HO -X-pnmer 


1 


2 


3 


5'- HO -Y-primer 


4 


5 




5 1 - HO -Z-pnmer 


7 






Pattern f (row 3 and column 3) 






matrix # 


5'-HO-X-primer 


S'-HO-Y-prtmer 


5'-HO-Z-pnmer 


5'- HO -X-prtmer 


1 


2 




5'- HO -Y-primer 


4 


5 





]0 



5'- HO -Z-pnmer 



15 



Signatures 

If we have 40 such (3x3) test matrices and we denote the 
possible hybridisation patterns a t b, c, d, e and f, then we can write the 
overall TRSPA hybridisation signature as follows 
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Matrix 


Pattern 


matrix 


pattern 


1 


a. b t c. d. e or f 


21 


a, b, c, d. e or f 


2 


a. b, c, d. e or f 


22 


a, b, c. d, e orf 


3 


a. b, c, d, e or f 


23 


a, b y c, d, e or f 


4 


a. b, c, d. e or f 


24 


a. b, c, d, e or f 


5 


a, b, c, d, e or f 


25 


a, b, c, d, e or f 


6 


a, b, c, d, e or f 


26 


a, b, c, d, e or f 


7 


a. b, c, d. e or f 


27 


a, b, c, d, e or f 


8 


a, b, c, d, e or f 


28 


a, b, c, d, e or f 


9 


a, b, c, d, e or f 


29 


a, b, c. d, e or f 


10 


a, b, c, d T e or f 


30 


a, b T c, d. e or f 


11 


a, b, c. d, e or f 


31 


a, b. c, d, e or f 


12 


a, b, c, d, e orf 


32 


a, b, c, d, e or f 


13 


a, b, c, d, e or f 


33 


a, b, c, d, e or f 


14 


a, b, c, d, e or f 


34 


a, b t c, d, e or 1 


15 


a, b, c, d, e or f 


35 


a, b, c, d, e or f 


16 


a, b, c, d, e or f 


36 


a, b, c, d, e or f 


17 


a, b, c, d T e or f 


37 


a f b, c t d, e or f 


18 


a, b, c, d, e or f 


38 


a, b, c, d, e orf 


19 


a, b, c, d, e or f 


39 


a t b, c, d, e orf 


20 


a, b. c P d, e or f 


40 


a, b, c, d, e or f 



Aspect (III) 

in another aspect the invention provides a nucleic acid 
5 characterisation method which comprises presenting to the set of arrays as 
defined above a nucleic acid fragment of interest under hybridisation 
conditions, and observing a pattern of hybridisation. Preferably, a plurality 
of nucleic acid fragments of interest are separately presented to the set of 
arrays, and the resulting patterns of hybridisation are compared. 
io Preferably, the plurality of nucleic acid fragments of interest are drawn from 
the mixture of DNA fragments, enriched in fragments that are characteristic 
of a phenotype of interest, as described under the invention (1) above. 

Thus in this aspect the invention provides a method of 
identifying fragments of DNA that are characteristic of a phenotype of 
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interest, which method comprises recovering, cloning and amplifying 
individual DNA fragments from the mixture of DNA fragments obtained 
under invention (1) above, presenting the individual DNA fragments to the 
set of arrays as defined under hybridisation conditions, observing a pattern 
5 of hybridisation generated by each individual DNA fragment, and subjecting 
to further investigation any two or more individual DNA fragments whose 
hybridisation patterns are similar or identical. 

TRSPA signatures and whole genome association studies 

io After a given number of cycles of inter-population perfectly 

matched duplex depletion, phenotype determining fragments will be 
enriched but will not be entirely free from 'noise' fragments. Noise may 
result from unequal allelic frequencies for certain SNPs between the two 
populations. Noise will also result from the presence of somatic mutations 

15 in the cells used to prepare DNA fragments and from the use of 

polymerase chain reaction in some of the embodiments of the current 
invention. 

For the preferred embodiment, DNA is prepared from a library 
of clones (either genomic clones or cDNA clones) - with inserts derived 

:o from the individual(s) and propagated in some appropriate host and cloning 
vector system. Restriction enzyme fragmentation is used prior to cloning 
and polymerase chain reaction amplification is used to prepare the DNA for 
comparison in fragmented form. Priming sites within the vector sequence 
flanking the cloned restriction enzyme fragmented inserts are employed for 

25 one or more cycles of polymerase chain reaction amplification of the 
fragmented DNA of interest. The primers used for polymerase chain 
reaction amplification of the fragmented DNA of interest are again used 
after the phenotype-determining fragment enrichment process to 'rescue' 
and clone the enriched fragments. Cloned enriched fragments are colony 

30 purified, picked into appropriate storage containers, catalogued and 
archived, DNA probes are prepared from these single clones and are 



WO 00/55364 



PCT/GB00/009I6 



-78 - 

individually hybridised against 180 spot TRSPA arrays as above. TRSPA 
signatures are determined for many colonies. Most noise fragments will be 
random in nature and will thus be randomly distributed amongst the 6 20 = 
3.7-1 0 15 possible types of TRSPA signature. Phenotype-determining signal 
5 fragments, however, will be those where repeat TRSPA signatures are 
obtained as more and more colonies are sampled. The more frequently the 
repetition of a given TRSPA signature occurs per unit colonies sampled, 
the greater the signal to noise ratio and the more successful has been the 
enrichment. 

in A great many of the steps in this process are amenable to 

high throughput automation - enabling very iarge numbers of single colony 
TRSPA signatures to be determined with ease and extending the power of 
the current invention to cases where signal to noise ratios are beyond 
current approaches. 

15 Statistical correlations (associations) can initially be drawn 

between TRSPA signatures and phenotype. The clones giving rise to a 
particular TRSPA signature showing a useful association with a phenotype 
of interest can then be sequenced in order to determine at a DNA 
sequence level the association(s) with the phenotype of interest. Such 

:o associations have future predictive values for the phenotype of interest, 
knowing the genotype and will be of great value in medicine and 
pharmacogenetics. 

If the genome of interest is wholly or partially sequenced, we 
can also in silico restrict the DNA with all n enzymes, calculate the 

25 expected signature for each fragment and pattern match these expected 
signatures with the observed signature (taking into account any loss or gain 
of restriction sites due to polymorphic variation compared to the reference 
sequence) to immediately identify the fragment of interest within a gene, 
genomic region, chromosome or whole genome. This latter method will be 

30 of great value in those cases where a great many phenotype-determining 
fragments are obtained and repeat signatures are rare or unobtained. The 
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clustering of phenotype-determining fragments to adjacent DNA regions 
thus gives an association between those genomic regions and the 
phenotype of interest. 

5 Aspect (IV) 

in yet another aspect the invention provides a double- 
stranded DNA molecule having the sequence a-A-b-B...X-y-Y-z where A t 
B...X and Y are unique restriction sites for n different restriction 
endonuclease enzymes, and a, b...y f z denotes distances in base pairs, 
10 characterised in that each fragment, obtainable by cutting the DNA 

molecule by means of any one or more up to n of the restriction enzymes, 
has a different length from every other fragment. 

An example totally diagnostic internal control DNA which allows both 
15 the extent and exact nature of any example set 1 (or example set 2) 6 
bp cutter partial digestion to be unambiguously determined for inter- 
population perfectly matched duplex depletion or TRSPA restriction 

In both of the above schemes, it is important that limit 
digestion products are obtained. Monitoring the extent of partial digestion 

20 resulting from multi-enzymatic restriction and determining precisely which 
enzymes have failed to cut is a task of great importance. 

If we have up to six enzymes for DNA digestion - let us label 
these A, B, C, D, E and F. We need to somehow determine that these 
have all cut to completion during the fragmentation stage for inter- 

25 population perfectly matched duplex depletion and also for the digestion 
step prior to adaptor ligation in TRSPA fabrication. If any of the enzymes 
have failed to cut to completion, we need to know which ones and to what 
degree in order to effectively rectify the problem. 

30 The structure of the internal control DNA 

If we construct a double stranded DNA molecule with the 
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following structure 

end— t—A~u--B--v— C--W— D~x™E— y—F— z— end 

5 where the A, B, C, D, E and F denote the sites for restriction 

enzyme cutting and t, u, v, w, x, y and z denote distances in base pairs. 

This internal control DNA is either uniformly pre-labelled and 
added to the DNA of interest at an appropriate concentration prior to 
restriction or is Southern blot probed with a complementary sequence not 
10 found in the DNA of interest after restriction. 

All six enzymes can cut in only one way. 
One enzyme can fail to cut in 6 Ci = 6 ways, these are: A, B, 
C, D, E or F failing to cut. 

Two enzymes can fail to cut in 6 C 2 = 15 ways, these are: AB, 
15 AC, AD, AE, AF, BC, BD, BE, BF, CD, CE, CF, DE, DF or EF failing to cut 
Three enzymes can fail to cut in 6 C 3 = 20 ways, these are: 
ABC, ABD, ABE, ABF, ACD, ACE, ACF, ADE f ADF t AEF, BCD, BCE, BCF, 
BDE, BDF, BEF, CDE, CDF, CEF or DEF failing to cut. 

Four enzymes can fail to cut in 6 C 4 = 15 ways, these are: 
20 ABCD, ABCE, ABCF, ABDE, ABDF, ABEF, ACDE, ACDF, ACEF t ADEF, 
BCDE, BCDF, BCEF, BDEF or CDEF failing to cut 

Five enzymes can fail to cut in 6 C 5 = 6 ways, these are: 
ABCDE, ABCDF, ABCEF, ABDEF, ACDEF or BCDEF failing to cut. 
AH six enzymes can fail to cut in only one way. 
25 Each of the above possibilities will generate one or more 

fragments from the interna! control DNA. If each possible fragment has a 
discernible size from any other, then we can determine exactly which 
enzymes have cut and which have not from the size distribution of the 
fragments generated. The task is therefore to design such a DNA 
30 molecule. 
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Example simulations 

Seven simulations are given below * varying the size of inter- 
site fragments. Criteria for a successful outcome include the following 

1 . The inter-fragment spacing should be greater for larger 
5 fragments (so as to aid electrophoretic resolution). 

2. All possible fragments should be unambiguously resolvable in 
size from each-other. 

3. Size gaps between bands comprising different numbers of 
inter-site units should be greater than the size gaps between bands 

io comprising the same number of inter-site units. 

4. The size gaps and size spread from largest to smallest 
fragment should be electrophoretically compatible. 

Simulation 1 

I5 

Inter-site fragment sizes (in bp) 



End—A 


A-B 


B--C 


C— D 


D~E 


E--F 


F— end 


80 


90 


100 


110 


120 


130 


140 
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Possible digestion products obtained (in bp) 





Fragment length 
(in bp) 


size gap to the next 
smallest fragment 
{in bp) 


one unit fragments 






end— A 


OA 

oU 




A ED 

A — D 


fin 




B — C 


100 


1 A 
f U 


C—D 


110 


10 


D--E 


120 


10 


E— F 


130 


10 


F— end 


140 


10 


two unit fragments 






end— B 


170 


30 


A— C 


190 


20 


B— D 


210 


20 


C— E 


230 


20 


D---F 


250 


20 


E— end 


270 


20 


three unit fragments 






end— C 


270 


0 


A— D 


300 


30 


B---E 


330 


30 


C— F 


360 


30 


D---end 


390 


30 


four unit fragments 






end— D 


r 380 


-10 


A— E 


420 


40 


B— F 


460 


40 


C — end 


ouu 




five unit fragments 






end— E 


500 


0 


A— F 


550 


50 


B— end 


600 


50 


six unit fragments 






end— F 


630 


30 


B— end 


690 


60 


seven unit fragments 






end— end 


770 


80 
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Spread = 690 bp 



Simulation 2 



Inter-site fragment sizes (in bp) 



end— A 


A---B 


B— C 


C— D 


D-E 


E— F 


F— end 


90 


100 


110 


120 


130 


140 


150 



Possible digestion products obtained (in bp) 



1 t 



WO 00/55364 



84- 



PCT/GBOO/00916 



fl 


agment length s 
(m bp) 


ize gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end — A 


90 




A—B 


100 


10 


B— C 


110 


10 




120 


10 


n F 


130 


10 


P— F 


140 


10 


C.pnri 


150 


10 


TWO Ullll l f ayi i let iw 






onH— R 
fcJHU " D 


190 


40 


M O 


210 


20 


D U 


230 


20 


P— F 


250 


20 


n f 


270 


20 


E — end 


290 


20 


mree unn nayitivnu? 








300 


10 


M — L/ 


330 


30 


o _c 

D U 


360 


30 


P F 


390 


30 


D — end 


420 


30 


four unit uagmenus 






end — D 


420 


0 


A F 


460 


40 


R— F 


500 


40 


C — end 


540 


40 


five unit fragments 






end— E 


550 


10 


A— F 


600 


50 


B— end 


650 


50 


six unit fragments 






end— F 


690 


40 


B— end 


750 


60 


seven unit fragment 


S 




end — end 


840 


90 



Spread = 750 bp 



"id i i it 
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Simulation 3 

Inter-site fragment sizes (In bp) 



end— A 


A-B 


B-C 


C-D 


D-E 


E— F 


F— end 


100 


110 


120 
. 


130 


140 


150 


160 



5 

Possible digestion products obtained (in bp) 
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■ T77 


_ Arrt Ion nth SI 

agmeru lei sy lh ^ 

fin hn^ £ 


ze gap to the next 
,mailest tragment 
(in bp) 


"oneunit fragments 






end— A 


i no. 




A— B 


1 in 


10 


B— C 


190 


10 


C— D 


1 ^n 


10 


D— E 


1 AO 


10 


E — F 


t 3VJ 


10 


F— end 


1 AH 
I DU 


10 


two unit fragments 






end— B 




50 


A— C 


Tan 


20 


B — D 




20 


C--E 


£/V 


20 


D — F 




20 


E— end 


n i n- 


20 


three unit fragments 






end— C 


3oU 


20 


A — D 


360 


30 


B— E 


^yu 


30 


C-F 




30 


D— end 


>1 CA 


30 


four unit fragments 






end — D 


40U 


10 


A— E 


ouu 


40 


B— F 


t;4n 


40 


C— end 


con 


40 


five unit fraqments 






end— E 


600 


20 


A-F 


650 


50 


B— end 


700 


50 


j six unit fragments 






end—F 


750 


50 


B— end 


810 


60 


seven unit f ragmenl 


s 




end— end 


910 


100 



Spread = 810 bp 
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Simulation 4 

Inter-site fragment sizes (in bp) 



end— A 


A— B 


B— C 


C— D I D--E 


E--F 


F— end 


110 


120 


130 


140 150 


160 


170 
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Possible digestion products obtained (in bp) 
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ragmenx lengtn < 

/in hn\ 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end— A 


1 in 




A— B 


1 C.\J 


10 


B— C 




10 


C— D 


1 AC\ 


10 


D— £ 




10 


E— F 


-t en 
1 OU 


10 


F— end 


l/U 


10 


two unit fragments 






end— B 


230 


fin 


A— C 




20 


B— D 


2/U 


20 


C— E 




20 


D— F 


310 


20 


E— end 


3ou 




three unit fragments 






end— C 


360 


30 


A— D 


390 


^n 


B— E 


420 


30 


C— F 


450 


^n 


D— end 


480 




four unit fragments 






end— D 


500 


90 
cXf 


A— E 


540 


AO 


B-F 


580 


*fv 


C— end 


620 










end — E 


650 


30 


A-F 


700 


50 


B — end 


750 


50 


six unit fragments 






end— F 


810 


60 


B — end 


870 


60 


seven unit fragments 






end— end 


980 


110 



Spread = 870 bp 



hi i 
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Simulation 5 

Inter-site fragment sizes (in bp) 



end— A 


A~B 


B-C 


C---D 


D-E 


E— F 


F—end 


120 


130 


140 


150 


160 


170 


180 



5 

Possible digestion products obtained (in bp) 
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fragment length 
(m bp) 


size gap to the next 
smallest fragment 
(tn bp) 


one unEt fragments 






end—A 


120 




A— B 


130 


1 0 


B— C 


140 


1 0 


C— D 


150 


10 


D — E 


160 


10 


E---F 


170 


10 


F— end 


180 


10 


two unit fragments 






end— B 


250 


70 


A— C 


270 


20 


B— D 


290 


20 


C-E 


310 


20 


D-F 


330 


20 


E— end 


350 


20 


three unit fragments 






end— C 


390 


40 


A— D 


420 


30 


B— E 


450 


30 


C-F 


480 


30 


D— end 


510 


30 


tour unit fragments 






end— D 


540 


30 


A— E 


580 


40 


B— F 


620 


40 


C— end 


660 


40 


tive unit fragments 






end— E 


700 


40 


A-F 


750 


50 


B— end 


800 


50 


six unit fragments 






end— F 


870 


70 


B— end 


930 


60 


seven unit fragments 






end— end 


1050 


120 



Spread = 930 bp 
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Simulation 6 

Inter-site fragment sizes (in bp) 



end— A 


A— B 


B-C 


C-D j D— E 


E— F 


F— end 


130 


140 


150 


160 170 


180 


190 



Possible digestion products obtained {in bp) 



PCT/GBOO/00916 

WO 00/55364 

-92- 





agment length s 
{in bp) 


ize gap to the next 
smallest fragment 
{in bp) 


one unit fragments 






end — A 


130 




A— B 


140 


10 


B— C 


150 


10 


C— D 


160 


10 


D— E 


170 


10 


F — F 
l_ — r 


180 


10 


f end 


190 


10 


TWO Ut ill siciyiiiviiw 






end— B 


270 


80 


A c. 

r\ w 


290 


20 


R n 


310 


20 


O C 


330 


20 


n— F 


350 


20 


£ end 


370 


20 


three unn iidyincmo 






cilu* — w 


420 


50 


a n 


450 


30 


n p 

d — ni 


480 


30 




510 


30 


D**~end 


540 


30 


tour unit Tragmeru& 






eno — u 


580 


40 


A F 


620 


40 


R p 


660 


40 


C---end 


700 


40 


five unit fragments 






end— E 


750 


50 


A-F 


800 


50 


B— end 


850 


50 


six unit fragments 






end— F 


930 


80 


B— end 


990 


60 


seven unit fragment 


s 




end — end 


1120 


130 



Spread = 990 bp 
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Simulation 7 

Inter-site fragment sizes (in bp) 



end — A 


A-B | 8-C 


C--D 


D-E 


E— F 


F— end 


140 


150 J 160 


170 


180 


190 


200 



5 

Possible digestion products obtained (in bp) 



IIP] I ' II M i' 
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fr 


agment length s 
(in bp) 


ze gap to the next I 
smallest fragment 
(in bp) 


one unit fragments 






end — A 


140 




A---B 


150 


10 


8-C 


160 


10 


C-D 


170 


10 


D— e 


180 


10 


E-F 


190 


10 


F— end 


200 


10 


two unit fragments 






end— B 


290 


90 


A-C 


310 


20 


B-D 


330 


20 


C-E 


350 


20 


D-F 


370 


20 


E— end 


390 


20 


three unit fragments 






end — C 


450 


60 


A— D 


480 


30 


a— e 


510 


30 


C— F 


540 


30 


D— end 


570 


30 


four unit fragments 






end — D 


620 


50 


A-E 


660 


40 


B-F 


700 


40 


C— end 


740 


40 


five unit fragments 






end— E 


800 


60 


A-F 


850 


50 


B— end 


900 


50 


"7ix unit fragments 






end— F 


990 


90 


B— end 


1050 


60 


seven unit fragment 


s 




end — end 


1190 ) ^ 



WO 00/55364 



PCT/GB00/00916 



-95- 

According to the above criteria for success, simulation 7 
above clearly fulfils all of the requirements. 

An example totally diagnostic internal control DNA which allows both 
5 the extent and exact nature of any example set 1 (or example set 2) 4 
bp cutter partial digestion to be unambiguously determined for inter- 
population perfectly matched duplex depletion 

If we have up to three enzymes for DNA digestion - let us 
label these A, B and C. We need to somehow determine that these have 
io all cut to completion during the fragmentation stage for inter-popuiation 
perfectly matched duplex depletion. If any of the enzymes have failed to 
cut to completion, we need to know which ones and to what degree in order 
to effectively rectify the problem. 

is The structure of the internal control DNA 

If we construct a double stranded DNA molecule with the 
following structure 

end— t— A—u— B— v— C— w-- end 

20 

where the A, B and C denote the sites for restriction enzyme 
cutting and t, u } v and w denote distances in base pairs. 

This internal control DNA is uniformly pre-labeiled and added 
to the DNA of interest at an appropriate concentration prior to restriction or 
25 is Southern blot probed with a complementary sequence not found in the 
DNA of interest after restriction. 

All three enzymes can cut in only one way. 

One enzyme can fail to cut in 3 Ci = 3 ways, these are: A, B or 
C failing to cut. 

30 Two enzymes can fail to cut in 3 Cz = 3 ways, these are: AB, 

AC or BC failing to cut. 
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All three enzymes can fail to cut in only one way. 

Each of the above possibilities will generate one or more 
fragments from the internal control DNA. If each possible fragment has a 
discernible size from any other (and from any of the fragments in simulation 
5 7 above for the panel of up to 6 enzymes), then we can determine exactly 
which enzymes have cut and which have not from the size distribution of 
the fragments generated. The task is therefore to design such a DNA 
molecule. 

10 Example simulations 

Six simulations are given below - varying the size of inter-site 
fragments. Criteria for a successful outcome included the following: 
1 . The inter-fragment spacing should be greater for larger 

fragments (so as to aid eiectrophoretic resolution). 
15 2. All possible fragments should be unambiguously resolvable in 

size from each-other. 

3. Size gaps between bands comprising different numbers of 

inter-site units should ideally be greater than the size gaps between bands 
comprising the same number of inter-site units. 
20 4. The size gaps and size spread from largest to smallest 

fragment should be eiectrophoretically compatible. 
5. The largest fragment obtained should ideally be smaller than 

the smallest fragment obtained in simulation 7 above for the panel of up to 
six enzymes. 
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Simulation 1 

Inter-site fragment sizes (in bp) 



end— A 


A— 8 


B-C 


C— end 


10 


20 


30 


40 



5 

Possible digestion products obtained (in bp) 







fragment length 


size gap to the next 






{in bp) 


smallest fragment 








(in bp) 




one unit fragments 








end— A 


10 






A— B 


20 


10 




B~C 


30 


10 




C— end 


40 


10 




two umt fragments 








end — B 


30 


-10 




A— C 


50 


20 




B— end 


70 


20 




three unit fragments 








end— C 


60 


-10 




A— end 


90 


30 




four unit fragments 








end— end 


100 


10 



Spread = 90 bp 

10 

Simulation 2 

Inter-site fragment sizes (in bp) 



end — A 


A-B 


B— C 


C—end 


15 


25 


35 


45 



15 



iliir 
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Possible digestion products ohtained (in bp) 





fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end — A 


15 




A— B 


25 


10 


8~C 


35 


10 


C— end 


45 


10 


two unit fragments 






end — B 


40 


-5 


A-C 


60 


20 


B— end 


80 


20 


three unit fragments 






end— C 


75 


-5 


A— end 


105 


30 


four unit fragments 






end— end 


120 


15 



Spread = 105 bp 

5 

Simulation 3 

Inter-site fragment sizes (in bp) 



end— A 


A--B 


B-C 


C— end 


20 


30| 


40 


50 



10 
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Possible digestion products obtained (in bp) 





fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end—A 


20 




A— B 


30 


10 


B--C 


40 


10 


C— end 


50 


10 


two unit fragments 






end— B 


50 


0 


A— C 


70 


20 


B— end 


90 


20 


three unit fragments 






end— C 


90 


0 


A — end 


120 


30 


tour unit fragments 






end—end 


140 ) W 



Spread = 120 bp 

5 

Simulation 4 

Inter-site fragment sizes (in bp) 



end— A 


A— B 


B-C 


C— end 


25 


35 


45 


55 



to 
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Possible digestion products obtained (in bp) 





fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end— A 


25 




A— B 


35 


10 


8— C 


45 


10 


C— end 


55 


10 


two unit fragments 






end— B 


60 


5 


A—C 


80 


20 


3— end 


100 


20 


three unit fragments 






end— C 


105 


5 


A—end 


135 


30 


four unit fragments 






end— end 


160 


25 • 



Spread = 135 bp 

5 

Simulation 5 

Inter-site fragment sizes (in bp) 



end — A 


A— B 


B— C 


C— end 


20 


25 


30 


35 
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Possible digestion products obtained (in bp) 





fragment length 
(in bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






end— A 


20 




A— B 


25 


5 


B— C 


30 


5 


C— end 


35 


5 


two unit fragments 






end— B 


45 


10 


A— C 


55 


10 


B—end 


65 


10 


three unit fragments 






end— C 


75 


10 


A— end 


90 


15 


four unit fragments 






end— end 


110 


20 



Spread = 90 bp 

5 

Simulation 6 

Inter-site fragment sizes (in bp) 



end — A 


A— B 


8— C 


C«-end 


25 


30 


35 


40 
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Possible digestion products obtained (in bp) 





fragment length 
un bp) 


size gap to the next 
smallest fragment 
(in bp) 


one unit fragments 






ena—A 


25 




A— 8 


30 


5 


B— C 


35 


5 


C— end 


40 


5 


two unit fragments 






end—B 


55 


15 


A— C 


65 


10 


B—end 


75 


10 


three unit fragments 






end—C 


90 


15 


A — end 


105 


15 


four unit fragments 






end— end 


130 


25 



Spread = 1 05 bp 

5 

According to the above criteria for success, simulation 6 
clearly fulfils all of the requirements. 

Example determination of the entire set of internal control DNA limit 
io and partial digestion patterns for a panel of up to six restriction 
enzymes 

For the example simulation 7, the entire set of internal control 
DNA limit and partial digestion patterns for a panel of up to six restriction 
enzymes can be determined as below. 
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Inter-site fragment sizes (in bp) 



end—A 


A—B 


B-»G 


C™0 


D— £ 


E-F 


F— end 


140 


150 


160 


170 


180 


190 


200 



Possible digestion products obtained 

5 All six enzymes can cut in only one way. 



failed digests 


bp 


bp I bp 


bp 


bp 


bp 


bp 


none 


140 


150 160 


170 


180 


190 


200 



One enzyme can fail to cut in 6 Ci = 6 ways, these are: A, B, 
C, D r E or F failing to cut. 

10 



failed digest 


bp 


bp 


bp 


bp 


bp 


bp 


A 


290 


160 


170 


180 


190 


200 


B 


140 


310 


170 


180 


190 


200 


C 


140 


150 


330 


180 


190 


200 


D 


140 


150 


160 


350 


190 


200 


E 


140 


150 


160 


170 


370 


200 


F 


140 


150 


160 


170 


180 


390 
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Two enzymes can fail to cut in 6 C 2 = 15 ways, these are: AB, 
AC, AD. AE, AF, BC f BD. BE. BF. CD, CE f CF. DE, DF or EF failing to cut. 



failed digests 


bp 


bp 


op 


bp 


bp 


AB 


450 


170 


180 


190 


200 


AC 


290 


330 


180 


190 


200 


AD 


290 


160 


350 


190 


200 


AE 


290 


160 


170 


370 


200 


AF 


290 


160 


170 


180 


390 


BC 


140 


480 


180 


190 


200 


BD ■ 


140 


310 


350 


190 


200 


BE 


140 


310 


170 


370 


200 


BF 


140 


310 


170 


180 


390 


CD 


140 


150 


510 


190 


200 


CE 


140 


150 


330 


370 


200 


CF 


140 


150 


330 


180 


390 


DE 


140 


150 


160 


540 


200 


DF 


140 


150 


160 


350 


390 


EF 


140 


150 


160 


170 


570 
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Three enzymes can fail to cut in 5 C 3 = 20 ways, these are: 
ABC. ABD. ABE. ABF, ACD, ACE, ACF. ADE. ADF, AEF, BCD. BCE, BCF, 
BDE, BDF, BEF. CDE. CDF, CEF or DEF failing to cut. 



failed digests 


bp 


bp 


bp 


bp 


ABC 


620 


180 


190 


200 


ABD 


450 


350 


190 


200 


ABE 


450 


170 


370 


200 


ABF 


450 


170 


180 


390 


ACD 


290 


510 


190 


200 


ACE 


290 


330 


370 


200 


ACF 


290 


330 


180 


390 


ADE 


290 


160 


540 


200 


ADF 


290 


160 


350 


390 


AEF 


290 


160 


170 


570 


BCD 


140 


660 


190 


200 


BCE 


140 


480 


370 


200 


BCF 


140 


480 


180 


390 


BDE 


140 


310 


540 


200 


BDF 


140 


310 


350 


390 


BEF 


140 


310 


170 


570 


CDE 


140 


150 


700 


200 


CDF 


140 


150 


510 


390 


CEF 


140 


150 


330 


570 


DEF 


140 


150 


160 


740 



5 
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Four enzymes can fail to cut in 5 C 4 = 15 ways, these are: 
ABCD. ABCE, ABCF, ABDE, ABDF. ABEF, ACDE, ACDF, ACEF. ADEF, 
BCDE. BCDF, BCEF. BDEF or CDEF failing to cut. 



ICttlCU UiyCoiO 


bn 


bD 


bo 






1 90 


200 






?7fl 
o / u 


200 


ABCF 


620 


180 


390 


ABDE 


450 


540 


200 


ABDF 


450 


350 


390 


ABEF 


450 


170 


570 


ACDE 


290 


700 


200 


ACDF 


290 


510 


390 


ACEF 


290 


330 


570 


ADEF 


290 


160 


740 


BCDE 


140 


850 


200 


BCDF 


140 


660 


390 


BCEF 


140 


480 


570 


BDEF 


140 


310 


740 


CDEF 


140 


150 


900 



5 
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Five enzymes can fail to cut in 6 Cs = 6 ways, these are: 
ABCDE ; ABCDF, ABCEF, ABDEF, ACDEF or BCDEF failing to cut. 



failed digests 


bp 


bp 


ABCDE 


990 


200 


ABCDF 


800 


390 


ABCEF 


620 


570 


A8DEF 


450 


740 


ACDEF 


290 


9Q0 


BCDEF 


140 


1050 



5 Ail six enzymes can fail to cut in only one way. 



failed digests 


bp 


all 


t!90 



Example determination of the entire set of internal control DNA limit 
and partial digestion patterns for a pane! of up to three restriction 
io enzymes 

For the example simulation 6, the entire set of internal control 
DNA limit and partial digestion patterns for a panel of up to three restriction 
enzymes can be determined as below. 

15 Inter-site fragment sizes (in bp) 



end—A 


A-B 


B-C 


C— end 


25 


30 


35 


40 



Possible digestion products obtained 

All three enzymes can cut in only one way. 

20 



failed digests 


bp 


bp 


bp 


bp 


none 


25 


30 


35 


40 
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One enzyme can fail to cut in 3 Ci = 3 ways, these are; A, B or 
C failing to cut. 



failed digests 


bp 


bp 


bp 


A 


55 


35 


40 


S 


25 


65 


40 


C 


25 


30 


75 



5 Two enzymes can fail to cut in 3 C2 = 3 ways, these are; AB, 

AC or BC failing to cut. 



failed digests 


bp 


bp 


AB 


90 


40 


AC 


55 


75 


8C 


25 


105 



All three enzymes can fail to cut in oniy one way. 

JO 



failed digests 


bp 


ail 


130 
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Example 1a - The digestion internal control plasmid for the 6 bp 
cutter set 1 TRSPA enzymes BamHl, BsrGl, Hindllt, Ncoi, SpeL and 
A fill 

The plasmid pNW33 (shown below) was constructed to 
contain an insert with all of the 6 bp cutter TRSPA enzyme sites. 



Hindi ft (445) 

BspEI (466) 

14Q'bp\^BamH( (605) 

15Qbp\BstGf(755) 

160 bp\ Htndtfi (915) 

170 b P\Mcoi(lQ85) 

180 bp\ 

H 4- Spe/ (1266) 
190 bp 

r ± AMI (1455) 
200 bp 

J BspEi(1656) 



pNW33 (3954 bp) 
(not drawn to scale) 



Hmdfft (1686) 
^Hindifi (1715) 



BspEI sites define the outer ends of the 140 bp and the 200 
io bp fragments. The full sequence for pNW33 is shown below: 



tcgcgcgtttcggtgatgacggtgaaaacctctgacacatgcagctcccggagacggtcacagcttgtctgtaagcggatgcc 
gggagcagacaagcccgtcagggcgcgtcagcgggtgttggcgggtgtcggggctggcttaactatgcggcatcagagca 
gattgtactgagagtgcaccatatgcggtgtgaaataccgcacagatgcgtaaggagaaaataccgcatcaggcgccattc 

15 gccattcaggctgcgcaactgttgggaagggcgatcggtgcgggcctcttcgctattacgccagctggcgaaagggggatgt 
gctgcaaggcgattaagttgggtaacgccagggttttcccagtcacgacgttgtaaaacgacggccagtgaattcgagctcgg 
taccgggccccccctcgaggtcgacggtatcgataagcttgatcgcagctggtaatccggacgcccgcgtcgaagatgttgg 
ggtgttgtaacaatatcgattccaattcagcgggggccacctgatatcctttgtatttaattaaagacttcaagcggtcaactatga 
agaagtgttcgtcttcgtcccagtaaggatccgcactttgaattttgtaatcctgaagggatcgtaaaaacagctcttcttcaaatct 

20 atacattaagacgactcgaaatccacatatcaaatatccgagtgtagtaaacattccaaaaccgtgatggaatggaacaaca 
cttaaatgtacaccctggtaatccgttttagaatccatgataataattttctggattattggtaattttttttgcacgttcaaaattttttgca 
acccctttttggaaacaaacactacggtaggctgcgaaatgttcatactgttgagcaattcacgttcattataagcttttcactgcat 
acgacgattctgtgatttgtattcagcccatatcgtttcatagcttctgccaaccgaacggacatttcgaagtattccgcgtacgtg 
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atgttcacctcgatatgtgcatcigtaaaagcaattgttccaggaaccagggcgtatctcttcatagccatggaatacgcctttttc 
agtgttgcgatgctaatgccgttacaaatattccgagcaccaagaatggctgcgcgcttgcctggtacttgacgtcgtatttgacg 
gggtccttgagaaagtatttaaactggaacacaatctgaggaatgatcaaagcaaccaacgccaacgcataataactagtg 
caaiaccaagacctcccaataatagcacccagacttgtgtaataacctctggctctgatattgctccagatggaattggacgat 
5 atggctcattaattgcgicgatatctctatcataccagtcgttgattgtctgtgtatagccagtaagacaaggaccagacatcatca 
tgcaaagaatcgcttaagcccttcttggcctttatgaggatctctctgatttttcttgcgtcgagttttccggtaagacctttcggtactt 
cgtccacaaacacaactcctccgcgcaactttttcgcggttgttacttgactggccacgtaatccacgatctctttttccgtcatcgt 
ctttccgtgctccaaaacaacaacggcggcgggtccggattaccagctgcgatcaagcttatcgataccgtcgacctcgacct 
gcaggcatgcaagcttggcgtaatcatggtcatagctgtttcctgtgtgaaattgttatccgctcacaattccacacaacatacga 

iO gccggaagcataaagtgiaaagcctggggtgcctaatgagtgagctaactcacattaattgcgttgcgctcactgcccgctttc 
cagtcgggaaacctgtcgtgccagctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtaitgggcgctcttc 
cgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggtt 
atccacagaatcaggggataacgcaggaaagaacatatgagcaaaaggccagcaaaaggccaggaaccgtaaaaag 
gccgcgttgctggcgttmccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaa 

1 5 acccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccg 
gatacctgtccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcg 
ctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaaccc 
ggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacag 
agttcttgaagtggtggcctaactacggctacactagaaggacagtatttggtatctgcgctctgctgaagccagttaccttcgga 

20 aaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgc 
agaaaaaaaggatctcaagaagatcctttgatcttttctacggggtcigacgctcagtggaacgaaaactcacgttaagggatt 
ttggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagt 
aaacttggtctga'cagttaccaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgact 
ccccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctca 

25 ccggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaagtggtcctgcaaclttatccgcctcc 
atccagtctattaattgttgccgggaagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgccattgctacaggc 
atcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagttacatgatcccccatgttgt 
gcaaaaaagcggttagctccttcggtcctccgatcgttgtcagaagtaagttggccgcagtgttatcactcatggttatggcagc 
actgcataattctcttactgtcatgccatccgtaagatgcttttctgtgactggtgagtactcaaccaagtcattctgagaafagtgta 

30 tgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacatagcagaactttaaaagtgctcatcatt 
ggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagttcgatgtaacccactcgtgcacccaac 
tgatcttcagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaaatgccgcaaaaaagggaataag 
ggcgacacggaaatgttgaatactcatactcttcctttttcaatattattgaagcattiatcagggttattgtctcatgagcggataca 
tatttgaatgtatttagaaaaataaacaaataggggttccgcgcacatttccccgaaaagtgccacctgacgtctaagaaacc 

35 attattatcatgacattaacctataaaaaiaggcgtatcacgaggccctttcgtc 



'IllUt f '[((("ri 
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The insert was introduced as a four fragment ligation of 140 
(Kpnl-BamHI), 1 50 / 1 60 (BamHI-Hindill), 1 70 / 1 80 (Hindl)i-Spei) and 
190 / 200 (Spel-Xhol) into Kpnl / Sail digested pUC19c DNA (Genbank 
X02514). The Sail and Xhol sites were lost as a result of the joining of their 

5 compatible sticky ends. The insert can be removed with BspEI or Pvull. 

The insert region of 1 190 bp and the short flanking regions to 
the vector junctions were sequenced twice in each direction in order to 
establish the piasmid sequence, in addition, a total of 63 analytical 
restriction digests and one minus-enzyme control were performed as 

!0 detailed in the following table: 







Expected band sizes 


Digest 


Weil 


bp 


bp 


bp 


bp 


bp 


bp 


bp 


Uncut 


1 


1190 














BamHi 


2 


140 


1050 












BsrGI 


3 


290 


900 












Hindfif 


4 


450 


740 












Ncoi 


5 


620 


570 . 












Spei 


6 


800 


390 












Aflli 


7 


990 


200 












8amHi BsrGI 


8 


140 


150 


900 










BamHI Hindtli 


9 


140 


310 


740 










BamHi Ncoi 


10 


140 


480 


570 










BamHi Spet 


11 


140 


660 


390 










BamHi Ami 


12 


140 


850 


200 










BsrGI Htndlit 


13 


290 


160 


740 










BsrGI Ncoi 


14 


290 


330 


570 










BsrGI Spei 


15 


290 


510 


390 










BsrGI Afiii 


16 


290 


700 


200 


— I 








Hindlll Ncoi 


17 


450 


170 


570 










Hmam Spet 


18 


450 


350 


390 










Hindlll AM 


19 


450 


540 


200 










Ncot Spel 


20 


620 


180 


390 










Ncoi Aflli 


21 


620 


370 


200 










Spet Aflli 


22 


800 


190 


200 










BamHi BsrGI Hmdlil 


23 


140 


150 


160 


740 








BamHi BsrGI Ncoi 


24 


140 


150 


330 


570 








BamHi BsrGi Spel 


25 


140 


150 


510 


390 








BamHt BsrGI Aflii 


26 


140 


150 


700 


200 









MJ)IJ II "MM ' i r 
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BamHI Hmdiii Ncol 


27 


| 140 


310 


170 


570 








BamHi Hmdiii Spei 


28 


140 


310 


350 


390 








SamHI Hmdiii Aiill 


29 


140 


310 


540 


200 








BamHi Ncol Spei 


30 


140 


480 


180 


390 








BamHI Ncoi Aflif 


31 


140 


480 


370 


200 








BamHi Spei AM 


32 


140 


660 


190 


200 








BsrGl Hindi!} Ncoi 


33 


290 


160 


170 


570 








BsrGl Hmdlli Spei 


34 


290 


160 


350 


390 








BsrGi hmdiii Aftll 


35 


290 


160 


540 


200 








BsrGi Ncoi Spei 


36 


290 


330 


tec 


390 








BsrGl Ncoi AM 


37 


290 


330 


370 


200 








BsrGI Spei Aflil 


38 


290 


510 


190 


200 








Hmdiii Ncol Spei 


39 


450 


170 


130 


390 








Hindlii Ncoi Affll 


40 


450 


170 


370 


200 








Hmdiii Spei Afili 


41 


450 


350 


190 


200 








Ncoi Spei Aflii 


42 


620 


180 


190 


200 








BamHI BsrGl Hmdiii Ncoi 


43 


140 


150 


160 


170 


570 






BamHi BsrGi Hmdlli Spei 


44 


140 


150 


160 


350 


390 






BamHi BsrGi Hmdiii AtUi 


45 


140 


150 


160 


540 


200 






BamHi BsrGl Ncoi Spei 


46 


140 


150 


330 


180 


390 






BamHi BsrGl Ncoi Atlll 


47 


140 


150 


330 


370 


200 






BamHi BsrGl Spei Afiii 


48 


140 


150 


510 


190 


200 






BamHi Hindttt Ncoi Spei 


49 


140 


310 


170 


180 


390 






BamHI Hindlfl Ncoi Afili 


50 


140 


310 


170 


370 


200 






BamHI Hmdiii Spei Afili 


51 


140 


310 


350 


190 


200 






BamHI Ncoi Spei Ami 


52 


140 


480 


180 


190 


200 






BsrGl Hmdlli Ncol Spei 


53 


290 


160 


170 


180 


390 






BsrGi Htndfti Ncol Aftlt 


54 


290 


160 


170 


370 


200 






BsrGl Hmdlli Spei Atlli 


55 


290 


160 


350 


190 


200 






BsrGl Ncoi Spei Afili 


56 


290 


330 


180 


190 


200 






Hmdiii Ncoi Spei AtUI 


57 


450 


170 


180 


190 


200 






BamHI BsrGl Hmdlli Ncol Spei 


58 


140 


150 


160 


170 


180 


390 




BamHi BsrGi Hmdiii Ncoi Afili 


59 


140 


150 


160 


170 


370 


200 




BamHi BsrGi Hmdlli Spei Atlll 


60 


140 


150 


160 


350 


190 


200 




BamHI BsrGl Ncoi Spei AW 


61 


140 


150 


330 


180 


190 


• 200 « 




BamHi Hmdiii Ncoi Spei Afili 


62 


140 


310 


170 


180 


190 


200 




BsrGl Hmdlli Ncoi Spei Aflll 


63 


290 


160 


170 


180 


190 


200 




BamHi BsrGi Hmdlli Ncoi Spei AMI 


64 


140 


150 


160 


170 


180 


190 


200 



All of these digests produced the expected fragment patterns 
on agarose gel electrophoresis and a number of these, shaded in the table 
above, are shown in figure 1. The restriction digests illustrated in figure 1 
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were carried out using the following conditions: 



Digest 1 {Minus enzyme control) 



10u1 


100 ng/mi internal control piasrrud DNA in 1x NEB butter #2+100 ug/mi BSA 


90 nl 


1x NEB buffer #2 + 100 ug/ml BSA 


Digest 5 


15 


0.33 U/ui Ncol in 1 x NEB buffer #2 + 1 00 ng/ml BSA 


10 jai 


100 ug/m{ internal control plasmid DNA in 1x NEB buffer #2 + 100 ug/ml BSA 


75 nl 


1x NEB buffer #2 + 100 ug/ml BSA 


Digest 9 


15 jii 


0.33 U/h( BamHhn 1x NEB buffer #2 + 100 ng/ml BSA 


15 \i\ 


0.33 U/hI Hindlll'm 1x NEB buffer #2 + 100 ng/ml BSA 


10 ji! 


100 ng/ml internal control plasmid DNA in 1x NEB buffer #2 + 100 ng/ml BSA 


60 \x\ 


1x NEB buffer #2+100 ng/mi BSA 


Digest 37 


15 \i\ 


0.33 U/ul SsrG/ in 1 x NEB buffer #2 + 1 00 ng/ml BSA 


15 fil 


0.33 U/hI Wco/ in 1x NEB buffer #2 + 100 ng/ml BSA 


15 Hi 


0,33 U/hI Aflll in 1x NEB buffer #2+100 ng/ml BSA 


10 Hi 


100 ng/ml internal control piasmid DNA in 1x NEB buffer #2 + 100 ng/mi BSA 


45 fil 


1x NEB buffer #2 + 100 ng/ml BSA 



15 
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Digest 49 



; 15 ill 


0.33 U/ui BamHI in 1 x NEB buffer #2 + 100 ug/mi BSA 


15 ul 


0.33 U/ui Hindfll in 1 x NEB buffer #2+100 ug/ml BSA 


I 15 ul 


0,33 U/ui Ncol in 1x NEB buffer #2+100 ug/ml BSA 


! 15 ul 


0.33 U/ui Spel in 1 x NEB buffer #2 + 100 ug/ml BSA 


! 10 u! 


■ 100 ug/ml internal control piasmid DNA in 1x NEB buffer #2 + 100 ug/ml BSA 


; 30 ui 


1x NEB buffer #2 + 100 ug/ml BSA 


Digest 61 


; is ui 


0.33 U/ui BamHI tn 1x NEB buffer #2 + 100 ug/ml BSA 


15 u! 


0.33 U/ui BsrG! in 1x NEB buffer #2 + 100 ug/m! BSA 


15 ui 


0.33 U/ui Ncol in 1 x NEB buffer #2+100 ug/mJ BSA 


15, ui 


0.33 U/ul Spe/ in 1x NEB buffer #2 + 100 ug/ml BSA 


15 ui 


0.33 U/ui A//// in 1x NEB buffer #2 + 100 ug/mi BSA 


10 ul 


100 ug/ml internal control plasmtd DNA in 1x NEB buffer #2 + 100 ug/ml BSA 


15 ui 


1x NEB buffer #2 + 100 ug/ml BSA 



All 64 restriction digests were incubated at 37°C for 6 hours 
and samples were then eiectrophoresed on 2.5 % FMC M eta P ho r agarose 
gels as illustrated in figure 1 . The lanes marked M contain mixed samples 
K) from all of the digests 1-64 and these lanes were used as size markers 
after confirmation of the fragment sizes against Stratagene Kb ladder 
markers. 

When spiked into a genomic DNA digest the internal control 
restriction fragment pattern produced is indicative of both the degree of 
15 digestion and the nature of any partial restriction at less than limit digestion. 
It is possible to deduce from the bands present which, if any, of the 
enzymes have failed to cut and therefore to take action to correct this 
before the DNA is used in a subsequent analysis or enrichment procedure. 
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Preparation of Bsp£7-re!eased internal control DNA on a large scale 

The method used for generating interna! control PCR product 
DNA (free from contaminating dNTPs) is depicted in the following figure: 



pNW33 



crude bio__ d Tp 

PCR orociuct | o»o 01 ^ 

SA- captured fr-SA-b.Q — + ^yp 

POP orotuct ^ o*Q-SA-# 

clean 0-SA-bio_ 



PCP oroduct _ + , toio-SAH 

3spEI | 
digest 



imemai 

control for RE digests 



Primers and PCR 



20(aM BIO140UP 

io 5' biotin-CGCAGCTGGTAATCCGGACGCCCGCGTCGAAGATGTT 3' 
20 \xM BIO200DOWN 

5 1 biotin-CGCAGCTGGTAATCCGGACCCGCCGCCGTTGTTGTT 3' 

is Bulk PCR amplification (192x 1 00 jul reactions) was carried 

out according to the conditions described below: 





per 20 mi 


final conditions 


100 mM dATP 


20 fii 


100mM 


100 mM dCTP 


20 JLLl 


100 uM 


100 mM dGTP 


20jil j 


100 pM 


100 mM dTTP 


20 |ul 


100uM 


10x PCR buffer 


2 ml 


1x 


25 mM MgCI 2 


400 til 


1/50th volume 


1 np/ul pNW33 


400 ui 


2 ng per PCR 


20 uM B1O140UP 


200 mJ 


20 pmot per PCR 


20 BIO200DOWN 


200 \x\ 


20 pmoi per PCR 


Tag DNA polymerase 


100 ul 


2.5 U per PCR 
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The master mix was rapidly dispensed into PCR tubes. 
Thermal cycling was initiated using the following parameters: 30 cycles of 
97 C C for 1 min, 50 C C for 2 min, and 72 C C for 3 min; 72°C for 5 min; and 
^ then 4°C. 

After PCR amplification, samples were pooled and subjected 
to capture of biotinylated PCR product termini and BspE] release of internal 
control DNA. 

io Capture of biotinylated PCR product termini and BspEl release of 
internal control DNA 

Alt separations were carried out using a Dynal MPC-1 
separator (Dynal, product #12001). 

20 ml of pooled PCR reaction were mixed with 20 ml of 
15 Dynabeads M-280 in 20 mM tris-HC! (pH 7.4), 2 mM EDTA (pH 8.0), 2 M 
NaCI. The tube was incubated at room temperature for 1 hour with rolling. 

The Dynabeads were then washed four times with 20 ml of 10 
mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 1 M NaCI. 

BspE! digestion was carried out for 1 hour at 37°C in 2 ml of 
:o 0.5 U/jil BspEl in 1x NEB buffer #3. The digest was then ethanol 

precipitated by the addition of 0.1 volume of 3 M sodium acetate (pH 5.2) 
and 2.5 volumes of ethanoi, chilling to -20°C and then centrifugation. 
Pellets were rinsed with 70 % ethanol prior to redissolution in 500 |il of 1x 
TE buffer. 

25 1 jiLi of the BspEl-released internal control DNA was mixed 

with 10 |il of 50 % glycerol AGE loading dye and electrophoresed on a 1.5 
% agarose gei to confirm that the size of the purified DNA was in 
accordance with that expected. 
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High molecular weight genomic DNA digestion in the presence of 
internal control DNA with a dilution series of a mixture of the six set 1 
6 bp cutters 

High molecular weight canine genomic DNA was first mixed 
? with a dilution series of a mixture of the six set 1 6 bp cutters - each at the 
same number of unte. Aliquots were then removed and mixed with the 
BspEI-released internal control DNA described above. 20 jug of canine 
genomic DNA was digested with 0.25, 0.025, 0,0025, 0.00025, 0.000025, 
0.0000025 and 0 U/ul BamH1 / BsrGI / Hindi!! / Ncol / Spel / Aflli in 200 jxi. 
lo 0.4 jag of canine genomic DNA and 1 jii of BspEI-released internal control 
DNA were digested with 0.25. 0.025, 0.0025, 0.00025, 0.000025, 
0.0000025 and 0 U/jxl BamH1 / BsrGI / Hindlil / Ncol / Spel / Aflll in 4 yd. 
Vistra Green staining was used to monitor the extent of internal control 
DNA digestion and high molecular weight DNA digestion. The fragment 
15 pattern from the internal control DNA is diagnostic of both the degree of 
digestion and the nature of any partial restriction at less than limit digestion. 

A premix of restriction enzymes, buffer and BSA was 
prepared as detailed below: 



component 


Hi 


100 U/jjU BamHl 


1 |ii 


10UAil BsrGI 


10 [il 


40 U/fil Hindlil 


2.5 |il 


50 UAil Ncol 


2 Hi 


50 U/hI Spel 


2 jil 


20 U/nl Aflll 


5 nl 


10x NEB buffer #2 


4 |il 


lOOx BSA 


0.4 |il 


water 


13. 1 ( ui 



20 

Totai volume = 40 \il 

The premix therefore contained each restriction enzyme at 
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2.5 U/|il in 1x NEB buffer #2 and 1x BSA. Serial 10-foid dilutions of this 
premix were then prepared in 1x NEB buffer #2 and 1x BSA. 

Premixes of canine genomic DNA, buffer and BSA were also 
prepared as detailed below: 



Component 


Ml 


1 mg/ml canine genomic DNA 


20 ul 


1Qx NEB buffer #2 


8ut 


100x BSA 


0.8 ul 


Water 


51.2ul 



These premixes therefore contain canine genomic DNA at 
0.25 mg/ml in 1x NEB buffer #2 and Ix BSA. 

Canine genomic DNA and restriction enzyme mixes were 
io then set up as follows: 



tube 


component #1 


component #2 


1 


80 ul of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 u( of each restriction enzyme at 2.5 
U/ui in 1x NEB buffer #2 and 1x BSA 


2 


80 ul of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 ui of each restriction enzyme at 0,25 
U/ul in 1x NEB buffer #2 and 1x BSA 


3 


80 ul of canine genomic DNA at 0.25 
mg/m! in 1x NEB buffer #2 and 1x BSA 


20 ul of each restriction enzyme at 0.025 
U/ul in 1x NEB buffer #2 and Ix BSA 


4 


80 ul of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 ul of each restriction enzyme at 
0.0025 U/ul in Ix NEB buffer #2 and 1x 
BSA 


5 


80 ul of canine genomic DNA at 0.25 
mg/mi in 1x NEB buffer #2 and 1x BSA 


20 ul of each restriction enzyme at 
0.00025 U/ui in ix NEB buffer #2 and 1x 
BSA 


6 


80 ui of canine genomic DNA at 0.25 
mg/ml m 1x NEB buffer #2 and 1x BSA 


20 ui of each restriction enzyme at 
0.000025 U/uf in Ix NEB buffer #2 and 
1x BSA 


7 


80 ul of canine genomic DNA at 0.25 
mg/ml in 1x NEB buffer #2 and 1x BSA 


20 ul of 1x NEB buffer #2 and 1x BSA 



miw v 
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2 liI aiiquots were then removed from tubes 1-7 and added to 

1 ,ui of BspEI-released internal control DNA and 1 ui of 2x NEB buffer #2 
and 2x BSA to give samples 1-7ic. Samples 1-7ic were then overlaid with 

5 50 jii of mineral oil in order to prevent evaporation. 

The remaining volume from tubes 1-7 was then added to 
100 jjiI of 1x NEB buffer #2 and 1x BSA. 

All samples were finally incubated at 37°C overnight. 

After digestion, 20 ui of digests 1-7 were mixed with 10 (il of 
i<> 50 % glycerol AGE loading dye and 4 ul of digests 1-7ic were mixed with 

2 |ui of 50 % glycerol AGE loading dye. Digests in loading dye were then 
electrophoresed on a 2.5 % MetaPhor agarose gel in 1x TBE. The gel was 
stained for 60 min in 500 ml of 1x TBE containing 50 \x\ of Vistra Green. 
The stained gel was finally imaged on a Fluorimager with the following 

15 settings: a 488 nm laser; a 570 DF 30 filter; a PMT setting of 700 V; 
200 pm resolution; and low sensitivity. 

Restriction digests - final conditions 

Restriction digests 1-7 were therefore performed under the 
20 following conditions: 



tube 


canine 
genomic 
DNA 


U 

BamH1 


u 

BsrGS 


U 

Hindltl 


u 

Ncol 


U 
Spel 


u 

Atm 


NEB 
buffer #2 


BSA 


1 


20 


50 


50 


50 


50 


50 


50 


IX 


100 


2 


20 


5 


5 


5 


5 


5 


5 


IX 


100 


3 


20 


0.5 


0.5 


0.5 


0.5 


0.5 


0.5 


IX 


100 


4 


20 


0.05 


0.05 


0.05 


0.05 


0.05 


0.05 


IX 


100 


5 


20 


0.005 


0.005 


0 005 


0.005 


0.005 


0.005 


IX 


100 


6 


20 


0.0005 


0.0005 


0.0005 


0.0005 


0.0005 


0.0005 


IX 


100 


7 


20 


0 


0 


0 


0 


0 


0 


1x 


100 



Total volume = 200 \i\ 
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Likewise. restriction digests 1-7ic were performed under the 
following conditions: 



tube 




ul 


U 


u 


u 


u 


u 


u 


NEB 


fxg/ml 




canine 


IC 


8amH1 


BsrGi 


Hinefflt 


Ncol 


Spe) 


Aftll 


: buffer 


BSA 




genomic 


DNA 














#2 






DNA 




















1ic 


0.4 


1 


1 


1 


1 


1 


1 


1 


1X 


100 


2ic 


0.4 


1 


0.1 


0.1 


0.1 


0.1 


0.1 


0.1 


1x 


100 


3ic 


04 


1 


0 01 


0 01 


0.01 


0.01 


0.01 


0.01 


1X 


100 


4>C 


04 




0.001 


0.001 


0.001 


0 001 


0.001 


0.001 


1x 


100 


5ic 


0.4 


1 


0.0001 


0.0001 


0 0001 


0.0001 


0.0001 


0.0001 


IX 


100 


6ic 


0 4 




0 O0001 


0.00001 


0.00001 


0 00O01 


0.00001 


0.00001 


1x 


100 


7ic 


0.4 




0 


0 


0 


0 


0 


0 


IX 


100 



IC = BspEl-released internal control DNA 

Total volume = 4 jil 

The results are shown in figure 2. 



WO 00/55364 



PCT/GB00/00916 



- 121 - 

Example lb - The digestion internal control ptasmid for the 4 bp 
cutter set 1 TRSPA enzymes Haelll, Mbol, and Mset 

The plasmid pNW35 (shown beiow) was constructed to 
contain an insert with ail of the 4 bp cutter TRSPA enzyme sites. 

Hirtdlii (52) 




5 

Hindlll and EcoRI sites define the outer ends of the 25 bp and 
the 40 bp fragments. The sequence of pNW35 is shown betow with the 
inserted region shown in bold type: 

10 atgaccatgaitacgccaagctctaatacgactcactatagggaaagcttccggacgtctcaggctaatgttggcccacc 
gacgttccacgatggggcgctcttaagggcttagaccctcgtcgggagtatttctgtgatctggcgacactcacgcg 
agaagtcattaccggcgatatgaattcactggccgtcgttttacaacgtcgtgactgggaaaaccctggcgttacccaactta 
aicgccttgcagcacatccccctttcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcg 
cagcctgaatggcgaatgggaaattgtaaacgttaatattttgttaaaattcgcgttaaatttttgttaaatcagctcattttttaacca 

15 ataggccgaaatcggcaaaatcccttaiaaatcaaaagaatagaccgagatagggttgagtgttgttccagtttggaacaag 
agtccactattaaagaacgtggactccaacgtcaaagggcgaaaaaccgtctatcagggcgatggcccactacgtgaacc 
atcaccctaatcaagttttttggggtcgaggtgccgtaaagcactaaatcggaaccctaaagggagcccccgatttagagcttg 
acggggaaagccggcgaacgtggcgagaaaggaagggaagaaagcgaaaggagcgggcgctagggcgctggcaa 
gtgtagcggtcacgctgcgcgtaaccaccacacccgccgcgcttaatgcgccgctacagggcgcgtcaggtggcacttttcg 

20 gggaaatgtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaat 
gcttcaataatattgaaaaaggaagagtatgagtaitcaacatttccgtgtcgcccttattcccttttttgcggcattttgccttcctgttt 
ttgctcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactggatctc 
aacagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggcgcggt 



WO 00/55364 



PCT/GB00/00916 



- 122- 



attatcccgtattgacgccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtc 
acagaaaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggcc 
aacttacttctgacaacgaicggaggaccgaaggagctaaccgcttttttgcacaacatgggggatcatgiaactcgccttgat 
cgttgggaaccggagctgaatgaagccataccaaacgacgagcgtgacaccacgatgcctgtagcaatggcaacaacgtt 

5 gcgcaaactattaactggcgaactacttactctagcttcccggcaacaattaatagactggatggaggcggataaagttgcag 
gaccacttctgcgctcggcccttccggctggctggtttattgctgataaatctggagccggtgagcgtgggtctcgcggtatcatt 
gcagcactggggccagatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggatgaacga 
aatagacagatcgctgagataggtgcctcactgattaagcatiggtaactgtcagaccaagtttactcatatatactttagattgat 
ttaaaacttcatttttaatttaaaaggatctaggtgaagatccmtigataatctcatgaccaaaatcccttaacgtgagttttcgttcc 

to acigagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgclgcttgcaaacaaaa 
aaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactcttmccgaaggtaactggcttcagcagagcg 
cagataccaaatactgtccttctagtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctc 
tgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggacicaagacgatagttaccggataa 
ggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacclacaccgaactgagatac 

15 ctacagcgtgagciatgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcg 
gaacaggagagcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgactt 
gagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggcc 
ttttgctggccttttgctcacaigttctttcctgcgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgataccgc 
tcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctc 

20 tccccgcgcgttggccgattcattaaigcagctggcacgacaggtttcccgactggaaagcgggcagtgagcgcaacgcaat 
taatgtgagttagctcactcattaggcaccccaggctttacactttatgcttccggctcgtatgttgtgtggaattgtgagcggataa 
caatttcacacaggaaacagct 

The 4 bp internal control piasmid for restriction enzymes 
25 Haefil, Mbol, and Msell was prepared by the insertion of a synthetic 130 bp 
fragment into Hindlll / EcoRI-digested pMOSblue (Amersham Pharmacia 
Biotech). 

The insert region of 130 bp was sequenced twice in each 
direction in order to establish the piasmid sequence. The presence of the 
30 restriction sites and the mobility of the fragments released were also 
checked by restriction digestion and poiyacrylamide gel electrophoresis. 



35 



Preparation of the internal control spike DNA from the piasmid 

The following illustration describes the strategy used for the 
preparation of EcoRI-reieased internal control DNA: 
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pNW35 

DNA | 
crude bio 



PCR product 

SA-captured 
PCR product 



PCR product 



RE digest 

4 bp cutter internal = 
control for RE diqests 



t 

! 



+ dNTP 



I 

A-captured fr-SA-hio . . , + 

i 

clean #-SA-bio_ 



Primers and PCR 

5 U-1 9 mer bio primer 5' bio-GTTTTCCCAGTCACGACGT 3' 
ICPCR{F) primer 5' TCCGGACGTCTCAGGCTAATGTT 3' 

Bulk PCR amplification (96x 100 (il reactions, repeated to 
id give 192 reactions in total) was carried out according to the conditions 
described below {all volumes are in 





xl Oil) 


X100 ftd) 


Final conditio*^ 


Water 


84.4 


8440 




10x PCR buffer 


10 


tooo 


1x 


25 mM dNTP mix 


0.8 


80 


200 


20 jjM T7 promoter-bio primer 


1 


100 


20 pmoi per PCR 


20 mM U19 mer-bio primer 


1 


100 


20 pmol per PCR 


1 ng/|il pNW35 


2 


200 


2 ng per PCR 


5 U/jliI Taq polymerase 


0.8 


80 


4U per PCR 


AV-\^. fc \ Totals v ^ 
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The PCR master mix was rapidly dispensed into 96 PCR 
tubes. Thermal cycling was initiated using the following parameters; 94°C 
for 2 min; 50 C C for 2 min; 29 cycles of 72 C C for 2 min, 94°C for 45 sec, and 
50°C for 1 min; 72 C C for 8 min; and then 4°C. 

Capture of biotinylated PCR product termini and Ecofl/- release of 
internal control DNA 

All separations were carried out using a Dynal MPC-1 
separator (Dynal, product #12001). 
m 20 ml of pooled PCR reaction were mixed with 20 ml of 

Dynabeads M-280 in 20 mM tris-HC! (pH 7.4), 2 mM EDTA (pH 8.0), 2 M 
NaCL The tube was incubated at room temperature for 1 hour with mixing 
on a Denley Spiramix 5. 

The Dynabeads were then washed four times with 20 ml of 1 0 
15 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 1 M NaCI. A fifth wash was 
performed in 20 ml of 1x buffer M. 

EcoRI-digestion was carried out for 1 hour at 37°C in 5 mi of 
0.25 U/ui EcoRI in 1x buffer M. The digest was then divided into ten 500 
ul aliquot parts. Each aliquot was ethanol precipitated by the addition of 1 
20 jal of See DNA, 0.1 volume of 3 M sodium acetate (pH 5.2), and 2.5 

volumes of ethanol. The precipitations were mixed and chilled to 0°C on 
ice for 30 minutes and then centrifugation at 20,000 maxRCF for 10 
minutes. The pellets were rinsed with 70 % ethanol before dissolving in a 
total of 500 ul of 1x TE buffer. 
25 A 1 aliquot of the EcoRI-released internal control DNA was 

electrophoresed on an 8 % polyacrylamide gel to confirm that the size of 
the purified DNA was in accordance with that expected. 
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High molecular weight genomic DNA digestion in the presence of the 
4 bp cutter internal control DNA with a dilution series of the enzymes 
Haelll, Msel, and Mbol 

High molecular weight human placental DNA was mixed with 
5 a dilution series of a mixture of the three set 1 4 bp cutter restriction 
endonucleases - each at the same number of units. Aliquots were then 
removed and mixed with the EcoRI-released internal control DNA. 3.6 p.g 
of Human placental DNA (Sigma) was digested with 0.5 U/jif, 0.1 U/jliI, 0.02 
U/ul, 0.004 U/^iL and 0 U/ ( ul each of Haeill, Mbol, and Msel in a total 
m volume of 36 ul. 0.4 ,ug of placental DNA and 1 of EcoRI-released 

Internal control DNA were digested with 0.5 U/^il, 0.1 U/jai, 0.02 U/jil, 0.004 
U/^il, and 0 U/ul each of Haelil, Mbol, and Msel in a total volume of 4 jiL 
Vistra Green (Amersham Pharmacia Biotech) staining was used to monitor 
the extent of internal control DNA digestion and high molecular weight DNA 
1 5 digestion. 

A premix of restriction enzymes (5 U/jjlI each enzyme), buffer, 
and BSA was prepared as described below: 



Component 




50 Ui[i\ HaeM 


1 


20 U/mJ Msel 


2.5 


25 U/jil Mbol 


2 


1 Ox NEB buffer #2 


1 


10x BSA 


1 


Water 


2.5 


Total 


10 



:o Serial 5-fold dilutions of the premix were prepared in 1x NEB 

buffer #2 and 1xBSA. 

A 6x premix of human placental DNA, buffer, and BSA were 
also prepared as described below: 
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! Component 


6x mix (pi) 


per reaction (pi) 


1 1 mg/ml human placental DNA 


24 


4 


j 10x NEB buffer #2 


9.6 


j 1.6 


i 10xBSA 


9.6 


1.6 


| Water 


52.8 


j 8.8 



This premix contained placental DNA at 0.25 mg/mi in 1x 
NEB buffer #2 and 1xBSA. 

Human placenta! DNA and restriction mixes were then set up 

5 as follows: 



; Tube 


Component #1 


Component #2 


1 


16 pi of placental DNA at 0.25 mg/ml m 
1 x NEB buffer #2 and 1 x BSA 


4 iil of each restriction enzyme at 5 U/pi 
in 1x NEB buffer #2 and 1x BSA 


2 


16 ul of placental DNA at 0.25 mg/ml in 
1x NEB buffer #2 and 1x BSA 


4 ( ul of each restriction enzyme at 1 U/pl 
in 1 x NEB buffer #2 and 1 x BSA 


3 


16 of placental DNA at 0.25 mg/ml in 
1x NEB buffer #2 and 1x BSA 


4 pi of each restriction enzyme at 0.2 
U/jmi in 1x NEB buffer #2 and 1x BSA 


4 


1 6 pi of placental DNA at 0.25 mg/ml in 
1 x NEB buffer #2 and 1 x BSA 


4 p! of each restriction enzyme at 0.04 
U/pl in tx NEB buffer #2 and 1x BSA 


5 


1 6 pf of placental DNA at 0.25 mg/ml in 
1x NEB buffer #2 and 1x BSA 


4 pi of 1x NEB buffer #2 and 1x BSA 



2 pi aliquots were then removed from tubes 1-5 and added to 
1 pi of EcoRI-released internal control DNA and 1 |xl of 2x NEB buffer #2 
10 and 2x BSA to give samples 1 -Sic. Samples 1 -Sic were then overlaid with 
50 jul of mineral oil in order to prevent evaporation. 

The remaining volume from tubes 1-5 was then added to 
18 pi of 1x NEB buffer #2 and 1x BSA. 

All samples were finally incubated at 37°C overnight. 



15 
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Restriction digests - final conditions 

Restriction digests 1-5 were therefore performed under the 
following conditions: 



lube 


iig human 


U 


u 


u 


N£B 


ug/mi 




placental 


Haeiif 


Mse! 


Mbol 


buffer #2 


BSA 




DNA 












1 


3.5 


20 


20 


20 


IX 


100 


2 


3.6 


4 


4 


4 


1x 


100 


3 


3.6 


o.a 


0.8 


08 


1X 


100 


4 


3.6 


0.16 


0.16 


0.16 


1X 


100 


5 


3.6 


0 


0 


0 


1x 


100 



Total volume = 36 ul 

Likewise, restriction digests 1 -Sic were performed under the 
following conditions; 

10 



tube 


pg human 


Hi 


U 


U 


U 


NEB 


ug/ml 




placenta! 


IC 


Haefli 


Msei 


Mbol 


buffer #2 


BSA 




DMA 


0NA 












1 


0.4 




2 


2 


2 


1x 


100 


2 


04 




0.4 


0.4 


0.4 


U 


100 


3 


0.4 




0.08 


0.08 


0.08 


u 


100 


4 


0.4 




0.016 


0.016 


0.016 


1X 


100 


5 


04 




0 


0 


0 


1X 


100 



IC = Ecoff/-released internal control DNA 
Total volume = 4 jjl! 

is 

10 |il of digests 1-5 were each mixed with 3 ul of loading dye 
and 4 jul of digests 1 -Sic were mixed with 1 jal of loading dye. To sample 
number Sic, 1 80 ng of PGR molecular weight markers were added to serve 
as size standards. The band sizes for these markers are 50 bp, 100 bp, 
20 200 bp, 300 bp, 400 bp, 500 bp, 525 bp, 700 bp, and 1000 bp. The digests 
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were then electrophoresed on an 8 % polyacryiamide gel in 1x TBE as 
described beiow; 

10 ml of 10x TBE 
5 1 9.5 m) of 40 % acryiamide 

19 ml of 2 % methylene bisacrylamide 
50.4 ml of water 

1 mi of freshly prepared 10 % (w/v) ammonium persulphate 
100 ul of TEMED 

iO 

A Cambridge Electrophoresis Ltd. vertical protein 
electrophoresis unit was used with 1 mm plate spacing. The samples were 
electrophoresed at 30 mA for 2 hours in 1x TBE. The gel was then stained 
for 30 minutes in 500 mi of 1x TBE containing 50 jil of Vistra Green. The 
1 5 stained gel was imaged on a Fiuorimager with the following settings: a 488 
nm laser; a 570 DF 30 filter; a PMT setting of 700 V; 200 [im resolution; 
and low sensitivity. 

The results are shown in figure 3. 

20 Example 2 

TRSPA-2 analysis with the pNW33 BamHI, Hindlll, and Mill matrix and 
the pNW33 Hindlll, Wco/, and Spel matrix probed with the 140 bp 
BspBl to BamHf fragment from pNW33 

25 As an exampie of the case where the probe hybridizes to an 

arrayed PCR fragment with a different restriction site at each end, the 
pNW33 BamHI, Hindlll, and Aflll matrix (matrix #7) was probed with a PCR 
product from the 140 bp BspEI (466) to BamHI (605) restriction fragment 
within pNW33. In this example, the probe binds to a 204 bp Hindlll to 

30 BamHI PCR product derived from the 160 bp Hindlll (445) to BamHI (605) 
restriction fragment within pNW33 (see below). 
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As an example of the case where the probe hybridizes to an 
5 arrayed PCR fragment with the same restriction site at each end, the 

pNW33 Hindlil, Ncol, and Spei matrix (matrix #17) was probed with a PCR 
product from the 140 bp BspEI (466) to BamHI (605) restriction fragment 
within pNW33. In this example, the probe binds to a 514 bp Hindlil to 
Hindlil PCR product derived from the 470 bp Hindlil (445) to Hindlil (915) 
io restriction fragment within pNW33 (see above). 



Oligonucleotides 



BamHI short PCR primer 5' TGTAACGACACATTGCTGGATACC 3' 

15 Hindlil short PCR primer 5' ATATAACTCTCGCTCCTTGATAAC 3* 

Ncol short PCR primer 5' AGGCGTCTGAGGCTGCGGCTATGG 3' 

Spel short PCR primer 5' AACCCGTCGCGACGAGAGTCTAAG 3' 

Aflll short PCR primer 5' GATATACGTGATATATTTTGATTG 3' 



20 BamHI adaptor 5' pGATCGGTATCCAGCAATGTGTCGTTACA 3' 

Hindlil adaptor 5' pAGCTGTTATCAAGGAGCGAGAGTTATAT 3* 

Ncol adaptor 5' pCATGCCATAGCCGCAGCCTCAGACGCCT 3' 

Spel adaptor 5' pCTAGCTTAGACTCTCGTCGCGACGGGTT 3' 
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Ailll adaptor 



5' pTTAACAATCAAAATATATCACGTATATC 3' 



BamHl long PCR primer 5' TGTAACGACACATTGCTGGATACCGATCC 3' 

Hindi II long PCR primer 5' ATATAACTCTCGCTCCTTGATAACAGCTT 3' 

Ncol long PCR primer 5' AGGCGTCTGAGGCTGCGGCTATGGCATGG 3' 

Spel long PCR primer 5' AACCCGTCGCGACGAGAGTCTAAGCTAGT 3' 

Aflll long PCR primer 5' GATATACGTGATATATTTTGATTGTTAAG 3' 



Luc140down primer 
10 5' GCGCTAGGGATCCTTACTGGGACGAAGACGAA 3' 



Luc140up-bio primer 

5' biotin-CGCAGCTGGTAATCCGGACGCCCGCGTCGAAGATGTT3' 



15 Restriction digestion of pNW33 

Restriction digests of pNW33 were set up by combining the 

following: 
Matrix #7 



450 pi 22 j.ig/ml pNW33 in 1 x NEB buffer #2 + 1 00 ug/ml BSA 


50 nl 


0.6 U/hI BamHl in 1x NEB buffer #2 + 100 ug/ml BSA 


50 Hi 


0.6 U/ul Hindlll'm 1x NEB buffer #2 + 100 ug/ml BSA 


50 Hi 


0.6 U/hI Aflll in 1x NEB buffer #2 + 100 jig/ml BSA 



Matrix #17 



450 Hi 


22 ug/ml pNW33 in 1x NEB buffer #2 + 100 ug/ml BSA 


50 Hi 


0.6 U/u-l Hindlll in 1x NEB buffer #2 + 1 00 fig/ml BSA 


50 Hi 


0.6 U/\i\ Ncol in 1x NEB buffer #2+100 jig/ml BSA 


50 Hi 


0.6 U/ul Spel in 1 x NEB buffer #2+100 (ig/ml BSA 
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The samples were then vortex mixed, and incubated at 37°C 

overnight. 

Calf intestinal alkaline phosphatase (CIAP) digestion of pNW33 
5 restriction digests 

400 ul fractions of the 20 restriction digests (each containing 
6.6 pg of digested pNW33) were ethanol precipitated by the addition of 1 pi 
of See DNA, 0.1 volume of 3 M sodium acetate (pH 5.2), and 2.5 volumes 
of ethanol, chilling to -20°C, and then centrifugation. Pellets were rinsed 
in with 70 % ethanoi prior to re-dissoiution in 20 jjl! of 1x CIAP buffer 

containing 40 U of CIAP. The CIAP digests were carried out for 5 hours at 
37°C and were then made up to 400 ul with 1 x TE buffer. 

Phenol extraction of pNW33 CIAP digests 

15 The diluted digests were extracted with 400 pi of phenol and 

then ethanol precipitated as described above but with a 100 % ethanol 
wash after the 70 % ethanol wash. Samples were finally re-dissolved in 
20 pi of TE buffer. 

20 Annealing of short PCR primers to cognate adaptors 

Short PCR primers and their cognate adaptors were annealed 
by adding 1 pi of 200 pM short PCR primer to 1 pi of 200 pM cognate 
adaptor in 20 pi of 50 mM NaCI, 1x TE buffer. The mixed oligonucleotides 
were overlaid with 30 pi of light mineral oil and were then heated to 90°C 
25 for 5 minutes followed by slow cooling to room temperature. The annealed 
short PCR primer / cognate adaptor complexes were then diluted with 1 mi 
of 1x TE buffer and stored frozen at -20°C, 

Ligation to annealed short PCR primers and cognate adaptors 

30 1 pi of phenol extracted pNW33 digest was used per ligation. 

Ligations were performed in 100 pi of 1x iigase buffer containing 1 mM ATP 
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and 10 uf (100 U) of T4 DNA ligase. 20 pi aitquots from the 1 mi of 
annealed short PGR primer / cognate adaptor complexes were added 
according to the following table. 



#7 


BamHl f Hinollll Affli 


#17 


HtndM I Ncol 1 Spel 



5 

Ligation reactions were carried out for 24 hours at 16°C, 
Samples were then diluted to 1 mi with TE buffer and stored at -2Q°C. 

The diluted ligation reactions were then further diluted 1 in 10 
and 10 pi was used as PCR template per 100 pi reaction. 

io 

Restriction digestion of human placental DNA 

1 U (^50 pg) of human placental DNA was digested overnight 
at 37°C with 100 U each of BamHl, BsrGI, Hindi!!, Ncol, Spel, and Aflitin 
400 pi of 1x NEB buffer #2 containing 100 pg/ml BSA. 

15 

CIAP digestion of human placental DNA restriction digest 

The digest was ethanol precipitated by the addition of 1 jii of 
See DNA, 0.1 volume of 3 M sodium acetate (pH 5.2), and 2.5 volumes of 
ethanol, chilling to -20°C } and then centrifugation. The pellet was rinsed 
:o with 70 % ethanol prior to re-dissolution in 50 pi of 1x CIAP buffer 

containing 100 U of CIAP. The CIAP digest was carried out for 5 hours at 
37°C and was then made up to 400 pi with 1x TE buffer. 

Phenol extraction of human placental DNA CIAP digest 

25 The diluted CIAP digest was extracted with 400 pi of phenol 

and then ethanol precipitated as described above - again with a 100 % 
ethanol wash after the 70 % ethanoi wash. The sample was finally re- 
dissolved in 10 pi of TE buffer. 
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Ligation to annealed short PCR primers and cognate adaptors 

Short PCR primers were annealed to their cognate adaptors 
as described above. 

The ligation to annealed short PCR primers and cognate 
? adaptors was carried out in 100 pi of 1x ligase buffer with 1 mM ATP. 100 
U of T4 DNA iigase and 10 pi of each of the six short PCR primer / cognate 
adaptor complexes as above. 

The ligation reaction was carried out for 24 hours at 16°C. 
The sample was then diluted to 1 mi with 1x TE buffer and stored at -20°C. 
10 0.2 pi was used as PCR template per 100 pi reaction. 

PCR amplification conditions 

An initial touch-down reaction was carried out in 50 pi of 1x 
PCR buffer with all four dNTPs at 200 pM and Taq DNA polymerase at 
15 0,05 U/pl. Long PCR primers were used at 400 nM. 10 pi of pNW33 PCR 
template was used per reaction and 0.2 pi of human placental DNA PCR 
template was used per reaction. The samples were overlaid with 40 pi of 
Sight mineral oil and were touch-down thermocycled as described below: 



20 98°C for 1 mm, 72°C for 2 min, 72°C for 5 min 
98°C for 1 min, 69°C for 2 min, 72°C for 5 min 
98°C for 1 min, 66°C for 2 min, 72°C for 5 min 
98°C for 1 min, 63°C for 2 min, 72°C for 5 min 
98°C for 1 min, 60°C for 2 min, 72°C for 5 min 

25 

The main thermal cycling was then carried out in 100 pi of 1x 
PCR buffer with ail four dNTPs at 200 and Taq DNA polymerase at 
0.05 U/pl. Long PCR primers were used at 400 nM. The samples were 
subjected to 20 cycles of 98°C for 1 min, 60°C for 2 min, 72°C for 5 min 
30 and then 72°C for 10 min followed by chilling at 10°C and recovery from 
under oil. 
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Arraying onto nylon membranes 

1 ul aiiquots from the PGR amplified samples were spotted 
onto Hybond N+ nylon membranes. The membranes were then transferred 
5 to a stack of three sheets of 3MM paper, saturated with 0.4 M NaOH, and 
incubated for 10 minutes. The NaOH was rinsed away in 2x SSC and the 
membranes were used directiy for the pre-hybridization. 

Probe synthesis 

io A PCR master mix was prepared as foitows (ail volumes are 

in \x\): 



Master Mix Preparation | 140 


Water 


183.6 


10x PCR buffer 


30 


2 mM dATP 


15 


2 mM dGTP 


15 


2 mM dllP 


15 


2 mM dCTP 


3 


1 40 probe template 


1 


Luc140up-bio 


2.5 


Lud40down 


2.5 


Taq DNA polymerase (5 U/\i\) 


2.4 


Total 


270 



Five PCR reactions were performed. For each reaction, 5 (il 
\5 of 33P-iabelied dCTP was added to the PCR tube. The reactions were 
then made up to 50 \i\ by the addition of 45 |nl of the master mix to each 
tube. Each reaction was gently mixed. 
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PCR cycling parameters 

94°C 2 min 

50 °C 2 min 

5 

72 C C 2 min 

94°C 45 sec 

50 °C 1 min 

72°C 8 min 

4°C hold 

10 After thermal cycling, the five PCR amplifications were 

pooled. The 250 jil of labelled DNA was then used for the following 
magnetic bead purification procedure. 

Capture of biotinylated PCR product and release of non-biotinylated 
i5 strands 

All separations were earned out using a Dyna) MPC-4 
separator (Dynal 5 product #12004). 

250 jii of the pooled PCR was mixed with an equal volume of 
10 mg/ml streptavidin-coated colloidal Fe304 particles in 20 mM tris-HCI 
20 (pH 7.4), 2 mM EDTA (pH 8.0), 2 M NaCL The tube was incubated at room 
temperature for 1 hour with mixing on a Denley Orbital Mixer 

The streptavidin-coated colloidal Fe304 particles were then 
washed with 1 ml of 10 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 1 M 
NaCI. Three more identical washes were performed. 
25 The washed streptavidin-coated colloidal Fe304 particles 

were incubated in 500 ^l of 0.1 M NaOH for 10 minutes at room 
temperature. The supernatant was removed and added to 500 jxi of 0.5 M 



} 



25 cycles 



WO 00/55364 



PCT/GBOO/00916 



- 136 - 

HEPES. The samples were then ethanol precipitated by the addition of 0.1 
volume of 3 M sodium acetate (pH 5,2) and 2.5 volumes of ethanol. chilling 
to 0 C C (on ice for 30 minutes), and then centrifugation at 20,000 maxRCF 
for 10 minutes. The pellets were rinsed with 70 % ethanol before 
5 dissolving in 100 f.il of TE buffer. 

Hybridization 

Each membrane was placed in a 55 mm x 35 mm x 21 mm 
plastic box and 1.25 mi of pre-hybridization solution (5x SSC; Denhardt's 

io solution; 1 % SDS; 10 % dextran sulphate [Mw 500,000]; 0.3 % 

tetrasodium pyrophosphate; 100 jig/ml denatured, sonicated DNA- pre* 
warmed to 65°C) was added. Each box was closed and incubated at 65°C 
for 50 minutes on a rocking platform. The pre-hybridization solution was 
removed and replaced with hybridization solution (5x SSC; Denhardt's 

J5 solution; 1 % SDS; 10 % dextran sulphate [Mw 500,000]; 0.3 % 
tetrasodium pyrophosphate; 100 fig/ml denatured, sonicated DNA - 
containing 5 \x\ of the appropriate 33P-labelled probe) and the box was 
incubated at 65°C for 3 hours on a rocking platform. 

20 Washing 

The membranes were drained and transferred to 200 ml of 2x 
SSC, 0.1 % SDS at 68°C for 30 minutes. A further wash was carried out in 
0.2x SSC, 0.1 % SDS at 71 °C for 30 minutes. The membranes were 
rinsed in 2x SSC at room temperature and laid out on blotting paper to 
25 remove excess liquid. Once dry, the membrane was covered in Saran 
Wrap and exposed to a Kodak Phosphor Screen for 1 hour. The phosphor 
screen was subsequently imaged using a Molecular Dynamics Storm 860 
Phosphorimager. 

Results are shown in figures 4 and 5, 
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Example #3a 

A single cycle of inter-population perfectly matched duplex depletion 
wherein E.coli MutS protein is used to capture an A/A mismatch- 
5 containing duplex 

'Affected' versus 'unaffected' (i.e. inter-population) mismatch- 
containing duplex selection can be achieved by: attaching a mismatch- 
binding protein to a solid support (or using the mismatch-binding protein in 
solution followed by subsequent solid-phase capture); taking denatured 
lo 'affected' DNA fragments and hybridizing these to denatured and 
biotinyfated 'unaffected' DNA fragments; and capture of mismatch- 
containing duplex molecules with the mismatch-binding protein. Releasing 
the mismatch-containing duplex molecules (without strand denaturation), 
streptavidin capture and then release of the non-biotinyiated strands will 
15 give only the desired species. 

in this example, PGR fragments are prepared and used to 
demonstrate each of the individual steps for a single cycie of inter- 
population perfectly matched duplex depletion using E.coli MutS protein. 

20 Clone insert design 

The clone inserts were constructed using standard cloning 
methodology weil known to those skilled in the art and were inserted 
between the Aval site and EcoRl of pMOSS/ue {Amersham Pharmacia 
Biotech). 

25 The clone inserts contain a common 9 base pair internal core 

sequence in which a single nucleotide change or an insertion can be 
located. The internal core sequence is derived from codons 272-274 of 
human p53. These codons (GTG CGT GGT) correspond to a mutational 
hotspot found in lung and other types of cancer (R273L). For the design of 

30 the clone inserts containing a mismatch (only one from a complete series of 
which is shown below), the nucleotide in position 5 of this core sequence is 
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modified (GTGCXTGGT). 

The internal core sequence is flanked by a random sequence 
- allowing the independent detection of the clone #1 and the clone #7 
insert sequence in a mixed population of done inserts. 

5 

Clone #1 

Mutant sequence (#1M) 

1 0 5 ' CCCGGGGGATCCTCGTTTTATTGGGCCGAGTTTTGGTCCGTAGTGCTTGGTTAGATATGCTTAT 
3 ' GGGCCCCCTAGGAGCAAAATAACCCGGCTCAAAACCAGGCATCACGAACCAATCTATACGAATA 

GTTCACAAAATCATCCTTGTACAGAATTC3 ' 
C AAGTGTTTTAGTAGG AACATGTCTTAAG 5 1 

15 

Control sequence (#1C) 

5 f CCCGGGGGATCCTCGTTTTATTGGGCCGAGTTTTGGTCCGTAGTGCATGGTTAGATATGCTTAT 
3 ' GGGCCCCCTAGGAGCAAAATAACCCGGCTCAAAACCAGGCATCACGTACCAATCTATACGAATA 

20 

GTTCACAAAATCATCCTTGTACAGAATTC3 ' 
CAAGTGTTTTAGTAGGAACATGTCTTAAG5 ' 

Clone #7 

25 

Control sequence (#7C) 

5 ' CCCGGGTGTACACAAAAGTTTACCTGAAGAACGTGGGGGGTCGTGCCTGGTCTTGCGTCACCTG 
3 "GGGCCCACATGTGTTTTCAAATGGACTTCTTGCACCCCCCAGCACGGACCAGAACGCAGTGGAC 

30 

GTCTCAGGAGAGGGTCCCCATGGGAATTC3 1 
CAGAGTCCTCTCCCAGGGGTACCCTTAAG 5 ' 



35 
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Preparation of upper strand S'-biotinylated and lower strand 5'- 
biotinylated #1C double-stranded PCR product, and upper strand 5 1 
biotinylated and lower strand S'-biotinylated #7C double-stranded 
PCR product 

5 

Oligonucleotides 

BIOUPST2 5' bio-CTACTGATCGGATCCCCG 3' 

BIODOWN3 5' bio-AAACGACGGCCAGTGAAT 3' 

io PCR reaction set-up (#1C) 

#1C in pMOSe/ue at 2.50 ug/ml 
Water 

10x PCR buffer 
J5 50 uM BIOUPST2 
50 uM BIODOWN3 
10mM dNTPs 

Taq DNA polymerase (5 U/ul) 

20 PCR reaction set-up (#7C) 

#7C in pMOSe/ue at 3.32 ug/ml 
Water 

10x PCR buffer 
25 50 uM BIOUPST2 
50 uM BIODOWN3 
10mM dNTPs 

Taq DNA polymerase (5 U/|il) 



100 ul 
8400 ul 
1000 ul 
100ul 
100 ul 
200ul 
100ul 



100ul 
8400 ul 
1000 ul 
100 ul 
100 ul 
200ul 
100 ui 
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96x 200 |il reactions were carried out for template #1C and 
96x 200 ul reactions were carried out for template #7C on a 96-weii Perkin 
Elmer Cetus GeneAmp PGR System 9600 machine as described below: 

95 °C f 5 minutes 1 cycle 
95 °C, 1 minute 
50 °C, 1 minute 30 cycles 
72 °C. 1 minute 
72 °C t 5 minutes 1 cycle 
4°C, hold 

The PCR products were pooled together and precipitated by 
adding 0.1 volumes of 3 M sodium acetate and 1 volume of isopropanol 
followed by centrifugation at 1 6.000 rpm tor 30 minutes at 4°C (in a 
Centrikon T-2070 ultracentrifuge; swinging bucket Kontron rotor TST 

m 41.14). Pellets were washed with 14 ml of ethanoi and centrifuged at 
20,000 rpm for 30 minutes. Finally, the pellets were air-dried and 
resuspended in a total volume of 0.6 ml of TE buffer. 

The 0.6 ml PCR sample was purified in twelve Microspin S- 
300 HR columns (50 jul per column) following the manufacturer's protocol. 

15 Briefly, the resin in the columns was resuspended by vortexing. Columns 
were centrifuged at 735 x g (3000 rpm in a microfuge) for 1 minute. The 
sample was then applied to the centre of the resin, being careful not to 
disturb the bed. The columns were centrifuged at 735 x g for 2 minutes 
and the flow-through containing the PCR product was collected. The 

20 twelve eluted 50 \x\ volumes were pooled together (pool 1 }. Columns were 
washed with 50 |ii of TE buffer and the eluted fractions were loaded onto a 
fresh S-300 HR column. Product yield and removal efficiency of the PCR 
primers were analysed on a 1 .5 % agarose gel. 



25 
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Preparation of non-biotinyiated #1M singie-stranded DNA and non- 
biotinyiated #7C single-stranded DNA 

Oligonucleotides 

5 

BIOUPST2 5' bio-CTACTGATCGGATCCCCG 3' 

DOWN3 5' AAACGACGGCCAGTGAAT 3' 

PCR reaction set-up (#1 M) 

io 

#1 M in pMOSBlue at 3.56 ug/mt 
Water 

10x PCR buffer 
50 uM BIOUPST2 
15 50 uM DOWN3 
10 mM dNTPs 

Tag DNA polymerase (5 U/ul) 
PCR reaction set-up (#7C) 

20 

#7C in pMOSBlue at 3.32 jig/ml 1 00 u) 

Water 8400 ul 

10x PCR buffer 1000ui 
50 uM BIOUPST2 100 jui 

25 50 uMDOWN3 100ul 
10 mM dNTPs 200ul 
Taq DNA polymerase (5 U/\i\) 1 00 ul 



100 jxl 
8400 ui 
1000 
100 ]x\ 
100 ul 
200ul 
100 ul 



30 



48x 200 ul reactions were carried out for template #1 M and 
48x 200 |il reactions were carried out for template #7C on a 96-wel! Perkin 
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Elmer Cetus GeneAmp PCR System 9600 machine as described below: 



95 °C, 5 minutes 1 cycle 
95 °C. 1 minute 
50 °C t 1 minute 30 cycles 
72°C f 1 minute 
72 °C, 5 minutes 1 cycle 
4°C f hold 



Capture of biotinylated PCR product strands and release of non- 
5 biotinylated strands for the preparation of non-biotinylated #1 M 
single-stranded DNA and non-biotinylated #7C single-stranded DNA 

10 ml of pooled #1M and #7C PCR amplifications were each 
mixed with an equal volume of 4 mg/ml streptavidin-coated colloidal Fe 3 0 4 
particles taken up in 20 mM tris-HCi (pH 7,4), 2 mM EDTA (pH 8.0), 2 M 
io NaCI. The tubes were incubated at room temperature for 60 minutes with 
mixing. 

The streptavidin-coated colloidal Fe 3 0 4 particles were then 
washed with 20 mi of 10 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 1 M 
NaCI. Two more identical washes were performed. 
15 The washed streptavidin-coated colloidal Fe 3 0 4 particles 

were finally incubated in 800 ^1 of 0.1 M NaOH for 10 minutes at room 
temperature. The supernatants were removed and added to 200 \xl of 2 M 
HEPES (free acid). Samples were quantified by absorbance at 260 nm. 



20 Denaturation and annealing 

Denaturation and annealing reactions were prepared by 

mixing: 



25 
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A = 400 ng of upper strand S'-biotinylated and lower strand 5- 
biotinylated #1C double-stranded PGR product 
B = 400 ng of upper strand S'-biotinylated and lower strand S'- 
biotinylated #7C double-stranded PCR product 
C = 200 ng of non-biotinylated #1 M single-stranded DNA 
(the lower of the two #1 M strands shown above) 
D = 200 ng of non-biotiny!ated #7C single-stranded DNA 
(the lower of the two #7C strands shown above) 



Reannealing between the upper strand of A and the single- 
stranded C will therefore give rise to an A/A mismatch-containing duplex, 
5'-biotinyiated on the upper strand, for clone insert #1 . 

5 Reannealing between the upper strand of B and the single- 

stranded D will therefore give rise to a perfectly matched duplex, 
S'-biotinylated on the upper strand, for clone insert #7, 

0,1 volumes of 1 M NaOH were then added, followed by 
incubation at room temperature for 10 minutes. 0.25 volumes of 2 M 

k> HEPES (free acid) were finally added followed by incubation at 42°C for 
1 hour. 

Samples were adjusted to 50 |ji and were made 1x in PBS 
and 1 mg/ml in BSA ready for reaction with MutS protein-coated magnetic 
beads. 

15 One 50 pi sample (the pre-enrichment control, sample 6) was 

used directly for capture of biotinylated PCR product strands and release of 
non-biotinylated strands. 

Mismatch capture 

20 20 jil of M2B2 MutS protein-coated magnetic particles 

(Genecheck, lot #20) were added to the annealed DNA above. Samples 
were incubated for 1 hour at room temperature with shaking. 

Samples were then washed twice with 200 fjJ of ice-cold PBS. 



I" 
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Samples were finally eluted from the magnetic beads for 10 
minutes at room temperature in 50 ul of the following: 



Sample 1 


1 M NaCI 


Sample 2 


PBS 


Sample 3 


1 M urea 


Sample 4 


1%(w/v) SDS 


Sample 5 


10 mM NaOH 



? Capture of biotinytated PCR product strands and release of non- 
biotinyiated strands 

All separations were carried out using an Amersham 
magnetic separator (RPN1682, batch #1). 

50 \il of the eluates from the MutS protein-coated magnetic 
io beads and the pre-enrichment control (sample 6) were each mixed with an 
equal volume of 4 mg/mi streptavidin-coated colloidal Fe 3 0 4 particles in 20 
mM tris-HCI (pH 7.4), 2 mM EDTA (pH 8.0), 2 M NaCI. The tubes were 
incubated at room temperature for 30 minutes with regular mixing. 

The streptavidin-coated colloidal Fe30 4 particles were then 
is washed twice with 500 jil of 10 mM tris-HCi (pH 7.4), 1 mM EDTA (pH 8.0), 
1 M NaCI at room temperature. 

The washed streptavidin-coated colloidal Fe30 4 particles 
were incubated in 10 jil of 0.1 M NaOH for 10 minutes at room 
temperature. The supernatant was removed and added to 2.5 ^l of 2 M 
20 HEPES (free acid). 

5 |il fractions were finally spotted onto Hybond N+ nylon 
membranes along with 200 ng, 20 ng, 2 ng, and 200 pg amounts of the 
following; 
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C = non-biotinylated #1M single-stranded DNA 
(the lower of the two #1M strands shown previously) 
D = norvbiotinyiated #7C singie-stranded DNA 
(the lower of the two #7C strands shown previously) 



Probe labelling 
Oligonucleotides 

5 

#1 probe oligo 5' GGCCGAGTTTTGGTCCGTAG 3' 
#7 probe oligo 5' GTCTTGCGTCACCTGGTCTCAG 3 5 

io Preparation of probes 

#1 probe oligo and #7 probe oiigo were radioactively 5' end- 
labeiled using T4 polynucleotide kinase as described below (alf volumes 
are in 





#1 probe 


#7 probe 


#1 probe oligo (25 pmol) 


1.25 




#7 probe oiigo (25 pmol) 




1.25 


10x PNK buffer 


2.5 


2.5 


[y- 33 P] ATP >92 TBq/mmol, >2500 Ci/mmol, 370 
mBq/ml, 10mCi/ml, 

(Amersham Pharmacia Biotech., AH9968, lot #B0006) 


12.5 


12.5 


T4PNK (1011/^1) 

(Amersham International, E70031Y, lot #201226) 


2.5 


2.5 


Water 


6.25 


6.25 



15 

The reactions were incubated at 37°C for 30 minutes and 
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then heated to 70°C for 5 minutes to denature the enzyme. 

MicroSpin G-25 column purification 

Two G-25 columns (APB 27-5325-01 , lot #9015325011) were 
5 resuspended by vortexing, and the bottom closures were snapped off as 
described in the manufacturer's instructions. A pre-spin of the columns 
was carried out for 1 minute at 730 maxRCF - 2670 rpm in a Hettich 
Zeutrifugen EBA 12 benchtop centrifuge - and the eluates were discarded. 
The 25 jii reactions were applied to each column, and the columns were 
io centrifuged for 2 minutes at 730 maxRCF. The eluates from the second 
spin were stored and used as probes. 

Hybridization 

Each membrane was placed in a 55 mm x 35 mm x 21 mm 
15 plastic box and 2.5 mi of pre-hybridization solution (5x SSC; Denhardf s 
solution; 1 % SDS; 10 % dextran sulphate [MW 500,000]; 0,3 % 
tetrasodium pyrophosphate; 100 jxg/ml denatured, sonicated DNA - pre- 
warmed to 42°C) was added. Each box was closed and incubated at 42 C C 
for 1 hour on a rocking platform. The pre-hybridization solution was 
:o removed and replaced with hybridization solution (5x SSC; Denhardt's 
solution; 1 % SDS; 10 % dextran sulphate [MW 500,000]; 0.3 % 
tetrasodium pyrophosphate; 100 \xg/ml denatured, sonicated DNA - 
containing 2.5 \i\ of the appropriate 33 P-labelled probe) and the box was 
incubated at 42°C overnight on a rocking platform. 

25 

Washing 

The membranes were drained and transferred to 200 mi of 2x 
SSC, 0.1 % SDS at 42°C for 10 minutes. A further wash was carried out in 
0.2x SSC, 0.1 % SDS at 42°C for 10 minutes. The membranes were 
30 rinsed in 2x SSC at room temperature and laid out on blotting paper to 
remove excess liquid. Once dry, the membrane was covered in Saran 
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Wrap and exposed to a Kodak Phosphor Screen for 1 hour. The phosphor 
screen was subsequently imaged using a Molecular Dynamics Storm 860 
Phosphorimager. 

Dot blot layout 



Sample 1 
5 faction 


Sample 2 
5 fiJ fraction 


Sampie 3 
5 jil fraction 


Sample 4 
5 ill fraction 


Sampie 5 
5 jil fraction 


Sample 6 
5 \i\ fraction 


200 ng 
#1MssDNA 


20 ng 

#1M ssDNA 


2 ng 

#1M ssDNA 


200 pg 
#1MssDNA 






200 ng 
#7C ssDNA 


20 ng 

#7C ssDNA 


2ng 

#7C ssDNA 


200 pg 
#7C ssDNA 







ssDNA = singie-stranded DNA 



Probe hybridisation results 



#1 probe 


#7 probe 
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Signal intensities from the spots above were reported using 
ImageQuant 5.0 software (Molecular Dynamics), with the SumAboveBG 
figures being used after drawing a 6x3 grid over the array of spots. 
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#1 probe SumAboveBG signals 



Sample i 
142257 


Sample 2 
24356 


Sample 3 
180359 


Sample 4 
394296 


Sample 5 
465242 


Sample 6 
658323 


200 ng 
#1M ssDNA 
15961011 


20 ng 
#1M ssDNA 
2842937 


2 ng 
#1M ssDNA 
252392 


200 pg 
#1MssDNA 
25316 


2281 


2475 


200 ng 
#7C ssDNA 
66536 


20 ng 
#7C ssDNA 
37955 


2 ng 
#7C ssDNA 
10621 


200 pg 
#7C ssDNA 
2736 


1955 


1934 


#7 probe SumAboveBG signals 


Sample 1 
66392 


Sample 2 
17071 


Sample 3 
118109 


Sample 4 
153372 


Sample 5 
166211 


Sample 6 
666853 


200 ng 
#1MssONA 
80873 


20 ng 
#1MssDNA 
23256 


2ng 
nMssDNA 
6371 


200 pg 
#1M ssDNA 
2399 


1831 


1857 


200 ng 
#7C ssDNA 
10619299 


20 ng 
#7C ssDNA 
2052027 


2ng 
#7C ssDNA 
207827 


200 pg 
#7C ssDNA 
15470 


2055 


1844 



Recovery 



Eiution conditions 


#1 signal 
(% input ssDNA) 


#7 signal 
{% input ssDNA) 


#1/#7 signal ratio 


1 M NaCl 


21.6% 


10.0% 


2.2 


PBS 


3.7 % 


2.6 % 


1.4 


1 M urea 


26.4 % 


17.7% 


1.5 


1 % SDS 


59.9 % 


23.0 % 


2.6 


10 mM NaOH 


70 .7 % 


24.9 % 


2.8 



Recovery figures are plotted below for the various different 
eiution conditions: 
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5 Example #3b 

A single cycle of inter-population perfectly matched duplex depletion 
wherein bacteriophage T4 endonuclease VIII protein containing a 
cieavage-inactivating N62D point mutation is used to capture an A/A 
io mismatch-containing duplex 

in this example, PCR fragments are again prepared and used 
to demonstrate each of the individual steps for a single cycle of inter- 
population perfectly matched duplex depletion using bacteriophage T4 
endonuclease Vill protein contain.ng a cleavage-inactivating N62D point 
15 mutation. 

Clone design and DNA preparation 

Clone insert design was exactly as described in example #3a 
(see clone #1: mutant sequence <#1M). control sequence (#1C) and clone 
20 #7: control sequence (#7C)). 
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Preparation of upper strand 5'-biotinyfated and lower strand 
S'-biotinylated #1C double-stranded PGR product, and upper strand 5'- 
biotinyiated and lower strand S'-biotinylated #7C double-stranded PGR 
product were as described in example #3a. 
5 Preparation of non-biotinylated #1 M singie-stranded DNA and 

non-biotinylated #7C single-stranded DNA were again as described in 
example #3a. 

Denaturation and annealing 

10 Denaturation and annealing reactions were prepared as 

described in example #3a. 

Reannealing will therefore again give rise to an A/A 
mismatch-containing duplex, S'-biotinylated on the upper strand, for clone 
insert #1 and a perfectly matched duplex, S'-biotinylated on the upper 
1 5 strand, for clone insert #7. 

One 50 jii sample (the pre-enrichment control) was adjusted 
to 100 |il with TE buffer and was used directly for capture of biotinylated 
PGR product strands and release of non-biotinylated strands. 

20 Mismatch capture 

50 \i\ samples of the annealing reaction were mixed with an 
equal volume of 200 mM sodium phosphate (pH 6,5), 100 mM KCL 10 jig 
of GST-tagged T4 endonuclease VII N62D mutant (obtained from Prof. 
Borries Kemper, Univ. Cologne) were added to the annealed DNA and the 
25 samples were incubated for 1 5 minutes at 16°C. 

Samples were then mixed with 200 iii of a 50 % slurry of 
Glutathione Sepharose 4B (Amersham Pharmacia Biotech, lot #279991) in 
100 mM sodium phosphate {pH 6,5), 50 mM KCl. The tubes were 
incubated at 16°C for 30 minutes with regular mixing. The mixture was 
30 then transferred to a spin column to separate solid from liquid phases. 

Samples were finally eluted from the Glutathione Sepharose 
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4B matrix for 10 minutes at room temperature in 100 of 10 mM reduced 
glutathione in 50 mM tris-HCI (pH 8.0). 

Capture of biotinyiated PCR product strands and release of non- 
5 biotinyiated strands 

All separations were carried out using an Amersham 
magnetic separator {RPN1682, batch #1), 

100 fil of the eluate from the N62D T4 endonuciease VII 
mismatch-capture reaction, and the pre-enrichment control were each 
io mixed with an equal volume of 4 mg/ml streptavidin-coated colloidal Fe304 
particles in 20 mM tris-HCI (pH 7.4), 2 mM EDTA (pH 8.0), 2 M NaCi. The 
tubes were incubated at room temperature for 30 minutes with regular 
mixing. 

The streptavidin-coated colloidal Fe30 4 particles were then 
\5 washed twice with 500 \x\ of 10 mM tris-HCI (pH 7.4), 1 mM EDTA (pH 8.0), 
1 M NaCI at room temperature. 

The washed streptavidin-coated colloidal Fe30 4 particles 
were incubated in 10 jil of 0.1 M NaOH for 10 minutes at room 
temperature. The supernatant was removed and added to 2.5 jil of 2 M 
20 HEPES (free acid). 

5 ill fractions were finally spotted onto Hybond N+ nylon 
membranes along with 200 ng, 20 ng, 2 ng, and 200 pg amounts of the 
following; 



C = non-biotinyiated #1M single-stranded DNA 
(the lower of the two #1M strands shown previously) 
D = non-biotinylated #7C single-stranded DNA 
(the lower of the two #7C strands shown previously) 

25 

Probe labelling 

Probe labelling was exactly as described in example #3a. 



WO 00/55364 



PCT/G BOO/0091 6 



-152- 



Hybridization 

Each membrane was placed in a 55 mm x 35 mm x 21 mm 
plastic box and 2.5 ml of pre-hybridization solution (5x SSC; Denhardt's 

5 solution: 1 % SDS; 10 % dextran sulphate [MW 500,000]; 0.3 % 

telrasodium pyrophosphate; 100 ^g/mi denatured, sonicated DNA - pre- 
warmed to 42°C) was added. Each box was closed and incubated at 42°C 
for 10 minutes on a rocking platform. The pre-hybridization solution was 
removed and replaced with hybridization solution (5x SSC; Denhardt's 

m solution; 1 % SDS; 10 % dextran sulphate [MW 500.000]; 0.3 % 
tetrasodium pyrophosphate; 100 jig/mi denatured, sonicated DNA - 
containing 2.5 jil of the appropriate 33 P-iabeiled probe) and the box was 
incubated at 42°C overnight on a rocking platform. 

\5 Washing 

The membranes were drained and transferred to 200 ml of 2x 
SSC, 0.1 % SDS at 42°C for 10 minutes. A further wash was carried out in 
Q,2x SSC, 0.1 % SDS at 42°C for 10 minutes. The membranes were 
rinsed in 2x SSC at room temperature and laid out on blotting paper to 

20 remove excess liquid. Once dry, the membrane was covered in Saran 
Wrap and exposed to a Kodak Phosphor Screen for 1 hour. The phosphor 
screen was subsequently imaged using a Molecular Dynamics Storm 860 
Phosphorimager. 

Signal intensities from the spots above were again reported 

25 using ImageGuant 5.0 software (Molecular Dynamics), with the 

SumAboveBG figures being used after drawing a grid over the array of 
spots. 
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#1 probe SumAboveBG signals 



200 ng 
#1MssDNA 
22418970 


20 ng 
#1MssDNA 
3317389 


2ng 
#1M ssDNA 
270664 


200 pg 
#1MssONA 
25763 


GSH eiuted 
sample 
144398 


200 ng 
#7C ssDNA 
229951 


20 ng 
#7C ssDNA 
105343 


2ng 
#7C ssDNA 
16354 


200 pg 
#7C ssDNA 
3632 


Pre- 
enrichment 
control 
997077 


#7 probe SumAboveBG signals 


200 ng 
#1MssDNA 
117267 


20 ng 
#1MssDNA 
42226 


2ng 
#1M ssDNA 
9421 


200 pg 
#1M ssDNA 
2812 


GSH eiuted 
sample 
113608 


200 ng 
#7C ssDNA 
3665967 


20 ng 
#7C ssDNA 
22248096 


2ng 
#7C ssDNA 
203869 


200 pg 
#7C ssDNA 
19406 


Pre- 
enriehment 
control 
1135786 



Recovery 



Eiutton conditions 


#1 signal 
(% input ssDNA) 


#7 signa! 
(% input ssDNA) 


#1/#7 signai ratio 


10 mM reduced 
glutathione in 50 mM 
tris-HCE (pH 8.0) 


14.5% 


10.0 % 


1.45 



it) The enrichment results are presented graphically below: 



iini hi ii' 'i i ' 



WO 00/55364 



- 154- 



PCT/GB00/00916 




Dedicated to the memory of Chris Griffin and Richard Beer 
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CLAIMS 

5 1 . A method of providing a mixture of DNA fragments enriched 

in fragments that are characteristic of a phenotype of interest, by providing 
affected DNA in fragmented form and providing unaffected DNA in 
fragmented form, which method comprises: 

a) mixing the fragments of the affected DNA and the fragments 
io of the unaffected DNA under hybridising conditions; 

b) recovering a mixture of hybrids that contain mismatches; 

c) recovering fragments of the affected DNA from the mixture of 
hybrids that contain mismatches; 

and optionally repeating steps a), b) and c) one or more 

J5 times. 

2. The method of claim 1 wherein the affected DNA is pooled 
DNA of individuals who show the phenotype of interest, and the unaffected 
DNA is pooled DNA of individuals who do not show the phenotype of 

20 interest. 

3. The method of claim 1 , wherein the affected DNA is DNA of 
one individual who shows the phenotype of interest, and the unaffected 
DNA is pooled DNA of individuals who do not show the phenotype of 

25 interest. 

4. The method of claim 1 , wherein the affected DNA is DNA of 
one individual who shows the phenotype of interest, and the unaffected 
DNA is pooled DNA of a complete set of ancestors who do not show the 

30 phenotype of interest. 
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5. The method of ciaim 1 , wherein the affected DNA is DNA 
from ceils of an individual that show the phenotype of interest, and the 
unaffected DNA is DNA from celts of the individual that do not show the 
phenotype of interest, 

5 

6. The method of any one of claims 1 to 5, wherein step b) is 
performed by use of a mismatch-binding protein. 

7. The method of any one of claims 1 to 6, wherein either the 

K) fragments of the affected DNA or the fragments of the unaffected DNA are 
tagged by one member of a specific binding pair, and step c) is performed 
by using the other member of the specific binding pair. 

8. The method of ciaim 7, wherein the fragments of the 

is unaffected DNA are tagged with biotin, and step c) is performed by use of 
immobilised streptavidin. 

9. The method of any one of claims 1 to 8, wherein the mixture 
of DNA fragments enriched in fragments that are characteristic of the 

20 phenotype of interest, is subjected to self-hybridisation followed by 
recovery of perfectly matched duplexes. 

10. The method of any one of claims 1 to 9, wherein the mixture 
of DNA fragments enriched in fragments that are characteristic of the 

25 phenotype of interest, is mixed with an excess of the fragments of the 
affected DNA under hybridisation conditions, followed by recovery of 
perfectly matched duplexes. 
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1 1 . The method of any one of claims 1 to 1 0. wherein each of the 
affected DNA and the unaffected DNA is provided in fragmented form by 
digestion with from 4 to 7 six-cutter restriction endonuciease enzymes 
together with from 0 to 5 four-cutter restriction endonuciease enzymes. 

5 

12. A mixture of DNA fragments enriched in fragments that are 
characteristic of the phenotype of interest, provided by the method of any 
one of claims 1 to 11 . 

it) 1 3. A method of making a set of arrays of fragments of DNA of 

interest, which method comprises: 

a) selecting, from a set of n restriction endonuciease enzymes, a 
subset of r restriction endonuciease enzymes; 

b) digesting genomic DNA with the subset of r enzymes; 

15 c) figating to the resulting fragments restriction-enzyme-cutting- 

site-specific adapters with unique polymerase chain reaction amplifiable 
sequences; 

d) splitting the resulting fragments into r 2 aiiquots; 

e) amplifying each aliquot with two restriction-enzyme-specific 
20 primers of which one is tagged; 

f) forming an array of the r 2 aiiquots of non-tagged 
amplimer strands; and 

g) repeating steps a) to f) using one or more different subsets of 
r restriction endonuciease enzymes. 



25 
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14. A method of making a set of arrays of fragments of DNA of 
interest, which method comprises: 

a) selecting, from a set of n restriction endonuclease enzymes, a 

subset of r restriction endonuclease enzymes; 
5 b) digesting genomic DNA with the subset of r enzymes; 

c) iigating to the resulting fragments restriction-enzyme-cutting- 
site-specific adapters with unique polymerase chain reaction amplifiable 
sequences; 

d) splitting the resulting fragments into r 2 aliquots; 

lo e) amplifying each aliquot with two restriction-enzyme-specific 

primers; 

f) forming an array of the r 2 aliquots of the amplimer strands; 
and 

g) repeating steps a) to f) using one or more different subsets of 
15 r restriction endonuclease enzymes. 

15. The method of claim 13 or claim 14 t wherein steps a to f) are 
repeated using each different subset of r restriction endonuclease enzymes 
to give (n!)/[{n-r)!r!] different arrays. 

20 

1 6. The method of any one of claims 1 3 to 1 5, wherein the n 
restriction endonuclease enzymes are selected from 4-cutters and 5-cutters 
and 6-cutters. 

25 17. The method of any one of claims 13 to 16, wherein n is 3 to 

10 and r is 2 to 4. 

1 8. The method of claim 1 7, where n = 6 and r = 3. 



30 



19. A set of arrays of fragments of DNA of interest, which set 

results from performance of the method of any one of claims 13 to 18. 
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20. The set of arrays of claim 19, which set results from 

performance of the method of claim 13 and claim 14 and claim 15. 

5 21 . The set of arrays of claim 19 or claim 20, derived from a set 

of n = 6 six-cutter restriction endonuclease enzymes which are BamHi; 
Bsr Gl; Hind lit; Ncol; Spei; and AfllL 

22. The set of arrays of claim 19 or claim 20, derived from the set 
io of n - 6 six-cutter restriction endonuclease enzymes which are EcoRI; 

BspHI; Bglll; Xbal; Acc651; and ApaLl. 

23. A nucleic acid characterisation method which comprises 
presenting to the set of arrays of any one of claims 1 9 to 22 a nucleic acid 

15 fragment of interest under hybridisation conditions, and observing a pattern 
of hybridisation. 

24. The method of claim 23 f wherein a plurality of nucleic acid 
fragments of interest are separately presented to the set of arrays, and the 

20 resulting patterns of hybridisation are compared. 

25. The method of claim 24, wherein the plurality of nucleic acid 
fragments of interest are drawn from the mixture of DNA fragments, 
enriched in fragments that are characteristic of a phenotype of interest, of 

25 claim 13. 

26. A method of identifying fragments of DNA that are 
characteristic of a phenotype of interest, which method comprises 
recovering, cloning and amplifying individual DNA fragments from the 

30 mixture of DNA fragments of claim 12, presenting the individual DNA 
fragments to the set of arrays of any one of claims 19 to 22 under 
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hybridisation conditions, observing a pattern of hybridisation generated by 
each individual DNA fragment, and subjecting to further investigation any 
two or more individual DNA fragments whose hybridisation patterns are 
similar or identical, or near to each other in a genome of interest. 

5 

27. A double-stranded DNA molecule having the sequence a-A-b- 
B...X~y-Y-z where A, B...X and Y are unique restriction sites for n different 
restriction endonuclease enzymes, and a T b...y, z denotes distances in 
base pairs, characterised in that each fragment, obtainable by cutting the 

io DNA molecule by means of any one or more up to n of the restriction 
enzymes, has a different length from every other fragment. 

28. The double-stranded DNA molecule of claim 27, wherein the 
following criteria are satisfied: 

15 a) inter-fragment length differences are greater for larger 

fragments; 

b) all possible fragments are unambiguously resolvable by 
electrophoresis from one another; 

c) size gaps between bands comprising different numbers of 
20 inter-restriction-site units are larger than size gaps between bands 

comprising the same number of inter-restriction-site units; 

d) the size gaps and size spread from the largest to the smallest 
fragment are electrophorectically compatible. 
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Figure 2 

Vfetra Green fluorescence 

1 \ig of Stratagene Kb DNA Ladder (band sizes: 250 bp, 500 bp, 750 bp, 1 kb, 1.5 kb, 2 kb, 3 kb, 4 kb, 
5 kb, 6 kb, 7 kb, 8 kb, 9 kb, 10 kb and 12 kb), 3x gaps, 2 ^ig of canine genomic DNA digested with 
0.25, 0,025, 0.0025, 0.00025, 0.000025, 0.0000025 and 0 U/jxl BamH1 1 BsrGf I Hindlil ! Ncoll Spell 
Aflll, gap, 0.4 jig of canine genomic DNA and 1 \x\ of Ssp£/-reieased internal control DNA digested 
with 0.25, 0,025, 0.0025, 0.00025, 0.000025, 0.0000025 and 0 U/jji BamH1 I BsrGf i Hindlil f NcoU 
Spell M III. 




From the results above, the fragment pattern from the internal control DNA is diagnostic of both the 
degree of digestion and the nature of any partial restriction at less than limit digestion. 

Very little digestion of canine genomic DNA and SspEZ-released internal control DNA is observed for 
0.00025 U/\xi BamH1 i BsrGU Hindlil I NcoU Spell Aflll and fewer units. With 0.0025 U/jii, BamHl 
and Aflll are both failing to cut more than the other enzymes (the 1 40 bp and 200 bp bands in 3ic are 
diagnostic of this). With 0.025 U/pi, most enzymes have cut - though not quite to completion. 0.25 
U/jii thus appears to be necessary in order to achieve limit digestion of 20 \ig of canine genomic DNA 
in a volume of 200 
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Figure 3 



Figure 3 shows the products of the restriction digests after electrophoresis on an 8 % poiyacryiamide 
gei. The numbers 1-5 indicate the dilution of enzyme mix used. 1=0.5 U/jai, 2=0.1 U/pi, 3=0.02 U/jii, 
4=0.004 U/jllI, and 5=0 \J/\xl 'ic 1 indicates the samples that were spiked with 4 bp cutter internal 
control DNA. PM indicates that 180 ng of PCR molecular weight markers were added as a size 
standard. 



All five of the partial digestion products are visible in lane ic4, between the top 130 bp band and the 
bottom 40 bp band, indicating that all three enzymes have failed to cut to completion at 0.004 U/\il In 
lane ic3, the four complete digestion products are all visible but the 65 bp band is also present. Haelll 
and Mbol have digested to completion but the Msel digestion is still partial at 0.2 U/\il At 0.1 U/\il and 
at 0.5 Uf\i\ all three enzymes have produced limit digests. 

Using the internal control spike, it is a great deal easier to determine the point at which limit digestion 
has occurred. The undigested internal control DNA is 1 30 bp. Partial digestion products generate 
bands of 105 bp, 90 bp, 75 bp, 65 bp, and 55 bp. The products of complete digestion generate bands 
of 40 bp* 35 bp, 30 bp, and 25 bp. The fragment pattern from the internal control DNA is therefore 
diagnostic of both the degree of digestion and the nature of any partial restriction at less than limit 
digestion. 
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Figure 4 

Expected TRSPA-2 pattern tor the pNW33 BamHl, Hindlll, and Aflll matrix probed with the 140 
bp BspEI to BamHl fragment from pNW33 

The TRSPA-2 pattern expected for matrix #7 {BamHl, Hindlll and Aflll) hybridized with a 33 P-Iabelled 
probe derived from the 140 bp BspEI- BamHHragment of pNW33 is shown below: 



#7 BamHl Hindlll AMI 



BamHl 



Hindlll 



Aflll 





HYB 




HYB 













Observed TRSPA-2 pattern for the pNW33 BamHl, Hindlll, and Aflll matrix probed with the 140 
bp BspEI to BamHl fragment from pNW33 

The TRSPA-2 pattern observed for matrix #7 (BamHl, Hindlll, and AfUf) hybridized with a 33 P-Iabelied 
probe derived from the 140 bp BspEI- BamHl fragment of pNW33 is shown below: 




1 
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Figure 5 

Expected TRSPA-2 pattern for the pNW33 Hindlll, Ncol, and Spel matrix probed with the 140 bp 
BspEI to BamHI fragment from pNW33 

The TRSPA-2 pattern expected for matrix #17 {Hindltl, Ncol, and Spef) hybridized with a 33 P-iabe!ied 
probe derived from the 140 bp BspEI- BamHI fragment of pNW33 is shown below: 



#17 Hindlll Ncol Spel 



Hindlll 



Ncol 



Spel 



HYB 


HYB 


HYB 


HYB 






HYB 







Observed TRSPA-2 pattern for the pNW33 Hindlll, Ncol, and Spel matrix probed with the 140 bp 
BspEI to BamHI fragment from pNW33 

The TRSPA-2 pattern expected for matrix #17 (Hindlll, Ncol, and Spef) hybridized with a 33 P-iabeiied 
probe derived from the 140 bp BspEI- BamHI fragment of pNW33 is shown below: 
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